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Enterococcus faecalis polynucleotides and polypeptides 



Field of the Invention 

The present invention relates to novel Enterococcus faecalis genes {E, faecalis) 
5 nucleic acids and polypeptides. Also provided are vectors, host cells and recombinant 
methods for producing the same. Further provided are diagnostic methods for 
detecting Enterococcus faecalis using probes, primers, and antibodies to the E. faecalis 
nucleic acids and polypeptides of the present invention. The invention further relates 
to screening methods for identifying agonists and antagonists of E. faecalis 
10 polypeptide activity and to vaccines using E. faecalis nucleic acids and polypeptides. 

Background of the Invention 

Enterococci have been recognized as being pathogenic for humans since the 
turn of the century when they were first described by Thiercelin in 1988 as 

15 microscopic organisms. The genus Enterococcus includes the species Enterococcus 
faecalis or E. faecalis which is the most common pathogen in the group, accounting for 
80 - 90 percent of all enterococcal infections. See Lewis et al. (1990) Eur J. Clin 
Microbiol Infect Dis.9:l 1 1-1 17. 

The incidence of enterococcal infections has increased in recent years and 

20 enterococci are now the second most frequently reported nosocomial pathogens. 

Enterococcal infection is of particular concern because of its resistance to antibiotics. 
Recent attention has focused on enterococci not only because of their increasing role in 
nosocomial infections, but also because of their remarkable and increasing resistance to 
antimicrobial agents. These factors are mutually reinforcing since resistance allows 

25 enterococci to survive in an environment in which antimicrobial agents are heavily 
used; the hospital setting provides the antibiotics which eliminate or suppress 
susceptible bacteria, thereby providing a selective advantage for resistant organisms, 
and the hospital also provides the potential for dissemination of resistant enterococci 
via the usual routes of hand and environmental contamination. 
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Antimicrobial resistance can be divided into two general types, inherent or 
intrinsic property and that which is acquired. The genes for intrinsic resistance, like 
other species characteristics, appear to reside on the chromosome. Acquired 
resistance results from either a mutation in the existing DNA or acquisition of new 

5 DNA. The various inherent traits expressed by enterococci include resistance to 
semisynthetic penicillinase-resistant penicillins, cephalosporins, low levels of 
aminoglycosides, and low levels of clindamycin. Examples of acquired resistance 
include resistance to chloramphenicol, erythromycin, high levels of clindamycin, 
tetracycline, high levels of aminoglycosides, penicillin by means of penicillinase, 

10 fluoroquinolones, and vancomycin. Resistance to high levels of penicillin without 
penicillinase and resistance to fluoroquinolones are not known to be plasmid or 
transposon mediated and presumably are due to mutation(s). 

Although the main reservoir for enterococci in humans is the gastrointestinal 
tract, the bacteria can also reside in the gallbladder, urethra and vagina. 

15 E. faecalis has emerged as an important pathogen in endocarditis, bacteremia, 

urinary tract infections (UTIs), intraabdominal infections, soft tissue infections, and 
neonatal sepsis. See Lewis et al. (1990) supra.. In the 1970s and 1980s enterococci 
became firmly established as major nosocomial pathogens. They are now the fourth 
leading cause of hospital-acquired infection and the third leading cause of bacteremia in 

20 the United States. Fatality ratios for enterococcal bactermia range from 12% to 68%, 
with death due to enterococcal sepsis in 4 to 50% of these cases. See T.G. Emori 
(1993) Clin. Microbiol. Rev. 6:428-442. 

The ability of enterococci to colonize the gastrointestinal tract, plus the many 
intrinsic and acquired resistance traits, means that these organisms, which usually 

25 seem to have relatively low intrinsic virulence, are given an excellent opportunity to 
become secondary invaders. Since nosocomial isolates of enterococci have displayed 
resistance to essentially every useful antimicrobial agent, it will likely become 
increasingly difficult to successfully treat and control enterococcal infections. 
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Particularly when the various resistance genes come together in a single strain, an 
event almost certain to occur at some time in the future. 

The etiology of diseases mediated or exacerbated by Enterococcus faecalis, 
involves the programmed expression of E. faecalis genes, and that characterizing these 
5 genes and their patterns of expression would dramatically add to our understanding of 
the organism and its host interactions. Knowledge of the E. faecalis gene and genomic 
organization would improve our understanding of disease etiology and lead to 
improved and new ways of preventing, treating and diagnosing diseases. Thus, there 
is a need to characterize the genome of E. faecalis and for polynucleotides of this 
10 organism. 

Summary of the Invention 

The present invention provides for isolated E. faecalis polynucleotides and 
polypeptides shown in Table 1 and SEQ ID NO: 1 through SEQ ID NO:496 

15 (polynucleotide sequences having odd SEQ ID NOs and polypeptide sequences 

having even SEQ ID NOs). One aspect of the invention provides isolated nucleic acid 
molecules comprising polynucleotides having a nucleotide sequence selected from the 
group consisting of: (a) a nucleotide sequence shown in Table 1; (b) a nucleotide 
sequence encoding any of the amino acid sequences of the polypeptides shown in 

20 Table 1 ; and (c) a nucleotide sequence complementary to any of the nucleotide 

sequences in (a) or (b). The invention further provides for fragments of the nucleic 
acid molecules of (a), (b) & (c) above. 

Further embodiments of the invention include isolated nucleic acid molecules 
that comprise a polynucleotide having a nucleotide sequence at least 90% identical, 

25 and more preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the 
nucleotide sequences in (a), (b) or (c) above, or a polynucleotide which hybridizes 
under stringent hybridization conditions to a polynucleotide in (a), (b) or (c) above. 
Additional nucleic acid embodiments of the invention relate to isolated nucleic acid 
molecules comprising polynucleotides which encode the amino acid sequences of 
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epitope-bearing portions of a E.faecalis polypeptide having an amino acid sequence in 
(a) above. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing 

5 the recombinant vectors, as well as to methods of making such vectors and host cells. 
The present invention further relates to the use of these vectors in the production of 
E. faecalis polypeptides or peptides by recombinant techniques. 

The invention further provides isolated E. faecalis polypeptides having an 
amino acid sequence selected from the group consisting of an amino acid sequence of 

10 any of the polypeptides described in Table 1 or fragments thereof. 

The polypeptides of the present invention also include polypeptides having 
an amino acid sequence with at least 70% similarity, and more preferably at least 75%, 
80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in Table 
1 , as well as polypeptides having an amino acid sequence at least 70% identical, more 

15 preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 95%, 
96%, 97%o, 98%, or 99% identical to those above; as well as isolated nucleic acid 
molecules encoding such polypeptides. 

The present invention further provides a single or multi-component vaccine 
comprising one or more of the E.faecalis polynucleotides or polypeptides described 

20 in Table 1 , or fragments thereof, together with a pharmaceutically acceptable diluent, 
carrier, or excipient, wherein the E.faecalis polypeptide(s) are present in an amount 
effective to elicit an immune response to members of the Enterococcus genus, or at 
least E.faecalis , in an animal. The E.faecalis polypeptides of the present invention 
may further be combined with one or more immunogens of one or more other 

25 Enterococcal or non-Enterococcal organisms to produce a multi-component vaccine 
intended to elicit an immunological response against members of the Enterococcus 
genus and, optionally, one or more non-Enterococcal organisms. 

The vaccines of the present invention can be administered in a DNA form, e.g., 
"naked" DNA, wherein the DNA encodes one or more Enterococcal polypeptides 
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and, optionally, one or more polypeptides of a non-Enterococcal organism. The DNA 
encoding one or more polypeptides may be constructed such that these polypeptides 
are expressed as fusion proteins. 

The vaccines of the present invention may also be administered as a 
5 component of a genetically engineered organism or host cell. Thus, a genetically 

engineered organism or host cell which expresses one or more E.faecalis polypeptides 
may be administered to an animal. For example, such a genetically engineered 
organism or host cell may contain one or more E.faecalis polypeptides of the present 
invention intracellular^, on its cell surface, or in its periplasmic space. Further, such 
10 a genetically engineered organism or host cell may secrete one or more E. faecalis 

polypeptides. The vaccines of the present invention may also be co-administered to 
-an animal with an immune system modulator (e.g., CD86 and GM-CSF). 

The invention also provides a method of inducing an immunological response 
in an animal to one or more members of the Enterococcus genus, preferably one or 
15 more isolates of the E.faecalis species, comprising administering to the animal a 
vaccine as described above. 

The invention further provides a method of inducing a protective immune 
response in an animal, sufficient to prevent, attenuate, or control an infection by 
members of the Enterococcus genus, preferably at least E.faecalis species, 
20 comprising administering to the animal a composition comprising one or more of the 
polynucleotides or polypeptides described in Table 1, or fragments thereof. Further, 
these polypeptides, or fragments thereof, may be conjugated to another immunogen 
and/or administered in admixture with an adjuvant. 

The invention further relates to antibodies elicited in an animal by the 
25 administration of one or more E. faecalis polypeptides of the present invention and to 
methods for producing such antibodies and fragments thereof. The invention further 
relates to recombinant antibodies and fragments thereof and to methods for producing 
such antibodies and fragments thereof. 

The invention also provides diagnostic methods for detecting the expression of 
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the polynucleotides of Table 1 by members of the Enterococcus genus in an animal. 
One such method involves assaying for the expression of a polynucleotide encoding 
E faecalis polypeptides in a sample from an animal. This expression may be assayed 
either directly (e.g., by assaying polypeptide levels using antibodies elicited in 

5 response to amino acid sequences described in Table 1) or indirectly (e.g., by assaying 
for antibodies having specificity for amino acid sequences described in Table 1). The 
expression of polynucleotides can also be assayed by detecting the nucleic acids of 
Table 1. An example of such a method involves the use of the polymerase chain 
reaction (PCR) to amplify and detect Enterococcus nucleic acid sequences. 

1 o The present invention also relates to nucleic acid probes having all or part of a 

nucleotide sequence described in Table 1 (odd SEQ ID NOs) which are capable of 
hybridizing under stringent conditions to Enterococcus nucleic acids. The invention 
further relates to a method of detecting one or more Enterococcus nucleic acids in a 
biological sample obtained from an animal, said one or more nucleic acids encoding 

15 Enterococcus polypeptides, comprising: (a) contacting the sample with one or more 
of the above-described nucleic acid probes, under conditions such that hybridization 
occurs, and (b) detecting hybridization of said one or more probes to the Enterococcus 
nucleic acid present in the biological sample. 

Other uses of the polypeptides of the present invention include: inter alia, to 

20 detect E. feecalis in immunoassays, as epitope tags, as molecular weight markers on 
SDS-PAGE gels, as molecular weight markers for molecular sieve gel filtration 
columns, to generate antibodies that specificaly bind E. faecalis polypeotides of the 
present invention for the detection E. faecalis in immunoassays, to generate an 
immune response against E. faecalis and other Enterococcus species, and as vaccines 

25 against E. faecalis, other Enterococcus species and other bacteria genuses. 

Isolated nucleic acid molecules of the present invention, particularly DNA 
molecules, are useful as probes for gene mapping and for identifying E. faecalis in a 
biological samples, for instance, by Southern and Northern blot analysis. 
Polynucleotides of the present invention are also useful in detecting E. faecalis by 
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PCR using primers for a particular E. faecalis polynucleotide. Isolated 
polynucleotides of the present invention are also useful in making the polypeptides of 
the present invention. 

5 Detailed Description 

The present invention relates to recombinant E. faecalis nucleic acids and 
fragments thereof. The present invention further relates to recombinant E. faecalis 
polypeptides and fragments thereof. The invention also relates to methods for using 
these polypeptides to produce immunological responses and to confer immunological 

10 protection to disease caused by members of the genus Enter ococcus, at least isolates 
of the E. faecalis genus. The invention further relates to nucleic acid sequences which 
encode antigenic E. faecalis polypeptides and to methods for detecting E. faecalis 
nucleic acids and polypeptides in biological samples. The invention also relates to 
antibodies specific for the polypeptides and peptides of the present invention and 

15 methods for detecting such antibodies produced in a host animal. 

Definitions 

The following definitions are provided to clarify the subject matter which the 
inventors consider to be the present invention. 

As used herein, the phrase "pathogenic agent" means an agent which causes a 
disease state or affliction in an animal. Included within this definition, for examples, 
are bacteria, protozoans, fungi, viruses and metazoan parasites which either produce a 
disease state or render an animal infected with such an organism susceptible to a 
disease state (e.g., a secondary infection). Further included are species and strains of 
the genus Enterococcus which produce disease states in animals. 

As used herein, the term "organism" means any living biological system, 
including viruses, regardless of whether it is a pathogenic agent. 

As used herein, the term "Enterococcus*' means any species or strain of 
bacteria which is members of the genus Enterococcus. Such species and strains are 
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known to those of skill in the art, and include those that are pathogenic and those that 
are not. 

As used herein, the phrase "one or more E.faecalis polypeptides of the 
present invention" means polypeptides comprising the amino acid sequence of one or 

5 more of the E.faecalis polypeptides described in Table 1 (even SEQ ID NOs). These 
polypeptides may be expressed as fusion proteins wherein the E. faecalis 
polypeptides of the present invention are linked to additional amino acid sequences 
which may be of Enterococcal or non-Enterococcal origin. This phrase further 
includes polypeptide comprising fragments of the E. faecalis polypeptides of the 

K) present invention. Additional definitions are provided throughout the specification. 

Explanation of Table 1 

Table 1 , below, provides information describing genes which encode 
polypeptides of E. faecalis. The table lists the gene identifier which consists of the 

15 letters EF, which denote E.faecalis, followed immediately by a three digit numeric 
code, which arbitrarily number the E.faecalis genes of the present invention. A 
number from 1 through 4 follows the three digit number. A number 1 represents the 
full length open reading frame of the gene specified by the preceeding three digit 
number. A number 2 represents the full leng+h polypeptide encoded by the gene 

20 specified the preceeding three digit number. A number 3 represents a polynucleotide 
fragment, of the gene represented by the preceeding three digit number, used to 
produce an antigenic polypeptide. A number 4 represents an antigenic polypeptide 
fragment , of the gene represented by the preceeding three digit number, used to 
stimulate an immune response or as a vaccine. The nucleotide and amino acid 

25 sequences of each gene and fragment are also shown in the Sequence Listing under the 
SEQ ID NO listed in Table 1. 

Explanation of Table 2 

Table 2 lists accession numbers for the closest matching sequences between 
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the polypeptides of the present invention and those available through GenBank and 
Derwent databases. These reference numbers are the database entry numbers 
commonly used by those of skill in the art, who will be familar with their 
denominations. The descriptions of the numenclature for GenBank are available from 

5 the National Center for Biotechnology Information. Column 1 lists the gene or ORF 
of the present invention. Column 2 lists the accession number of a "match" gene 
sequence in GenBank or Derwent databases. Column 3 lists the description of the 
"match" gene sequence. Columns 4 and 5 are the high score and smallest sum 
probability, respectively, calculated by BLAST. Polypeptides of the present 

10 invention that do not share significant identity/similarity with any polypeptide 

sequences of GenBank and Derwent are not represented in Table 2. Polypeptides of 
the present invention that share significant identity/similarity with more than one of 
the polypeptides of GenBank and Derwent are represented more than once. 

15 Explanation of Table 3. 

The E. faecalis polypeptides of the present invention may include one or more 
conservative amino acid substitutions from natural mutations or human manipulation 
as indicated in Table 3. Changes are preferably of a minor nature, such as conservative 
amino acid substitutions that do not significantly affect the folding or activity of the 

20 protein. Residues from the following groups, as indicated in Table 3, may be 

substituted for one another: Aromatic, Hydrophobic, Polar, Basic, Acidic, and Small, 

Explanation of Table 4 

Table 4 lists residues comprising antigenic epitopes of antigenic epitope- 
25 bearing fragments present in each of the full length E.faecalis polypeptides described 
in Table 1 as predicted by the inventors using the algorithm of Jameson and Wolf, 
(1988) Comp. Appl. Biosci. 4:181-186. The Jameson-Wolf antigenic analysis was 
performed using the computer program PROTEAN (Version 3.1 1 for the Power 
Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, WI). E. faecalis 
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polypeptide shown in Table 1 may one or more antigenic epitopes comprising 
residues described in Table 4. It will be appreciated that depending on the analytical 
criteria used to predict antigenic determinants, the exact address of the determinant 
may vary slightly. The residues and locations shown described in Table 4 correspond 
5 to the amino acid sequences for each full length gene sequence shown in Table 1 and in 
the Sequence Listing. Polypeptides of the present invention that do not have 
antigenic epitopes recognized by the Jameson- Wolf algorithm are not represented in 
Table 2. 

1 0 Selection of Nucleic Acid Sequences Encoding Antigenic E. faecalis Polypeptides 

Sequenced E. faecalis genomic DN A was obtained from the £. faecalis strain 
V586. The E. faecalis strain V586 was deposited 2 May 1997 at the ATCC, 10801 
University Blvd. Manassas, VA 201 10-2209, and given accession number 55969. 
Some ORFs contained in the subset of fragments of the E. faecalis genome 

1 5 disclosed herein were derived through the use of a number of screening criteria detailed 
below. The ORFs are bounded at the amino terminus by a methionine or valine 
residue and usually at the carboxy terminus by a stop codon. 

Most of the selected sequences consist of complete ORFs. The polypeptides 
that do not comprise a complete ORF can be determined by determining whether the 

20 corresponding polynucleotide sequence comprises a stop codon after the codon for 
the last amino acid residue in the polypeptide sequence. It is not always preferred to 
express a complete ORF in a heterologous system. It may be challenging to express 
and purify a highly hydrophobic protein by common laboratory methods. Some of 
the polypeptide vaccine candidates described herein have been modified slightly to 

25 simplify the production of recombinant protein. For example, nucleotide sequences 
which encode highly hydrophobic domains, such as those found at the amino terminal 
signal sequence, have been excluded from some constructs used for expression of the 
polypeptides. Furthermore, any highly hydrophobic amino acid sequences occurring 
at the carboxy terminus have also been excluded from the recombinant expression 
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constructs. Thus, in one embodiment, a polypeptide which represents a truncated or 
modified ORF may be used as an antigen. 

While numerous methods are known in the art for selecting potentially 
immunogenic polypeptides, many of the ORFs disclosed herein were selected on the 
5 basis of screening Enterococcus faecalis ORFs for several aspects of potential 
immunogenicity. One set of selection criteria are as follows: 

1. Type I signal sequence: An amino terminal type 1 signal sequence generally 
directs a nascent protein across the plasma and outer membranes to the exterior of the 
bacterial cell. Experimental evidence obtained from studies with Escherichia coli 

10 suggests that the typical type I signal sequence consists of the following biochemical 
and physical attributes (Izard, J. W. and Kendall, D. A. Mol. Microbiol. 1 J:765-773 
••■(J 994)). The length of the type I signal sequence is approximately 15 to 25 primarily 
hydrophobic amino acid residues with a net positive charge in the extreme amino 
terminus. In addition, the central region of the signal sequence adopts an alpha-helical 

1 5 conformation in a hydrophobic environment. Finally, the region surrounding the 

actual site of cleavage is ideally six residues long, with small side-chain amino acids in 
the -1 and -3 positions. 

2. Type IV signal sequence: The type IV signal sequence is an example of the 
several types of functional signal sequences which exist in addition to the type I signal 

20 sequence detailed above. Although functionally related, the type IV signal sequence 
possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, 
S., J. Bacteriol 1 74:7345-735 1 (1992)). These are typically six to eight amino acids 
with a net basic charge followed by an additional sixteen to thirty primarily 
hydrophobic residues. The cleavage site of a type IV signal sequence is typically after 

25 the initial six to eight amino acids at the extreme amino terminus. In addition, type IV 
signal sequences generally contain a phenylalanine residue at the +1 site relative to the 
cleavage site. 

3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial 
lipoprotein precursors has allowed the definition of a consensus amino acid sequence 
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for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors 
examined contained the sequence L-(A,S)-(G,A)-C at positions -3 to +1, relative to 
the point of cleavage (Hayashi, S. and Wu, H. C, J. Bioenerg. Biomembr. 22:451-41 1 
(1990)). 

5 4. LPXTG motif: It has been experimentally determined that most anchored 

proteins found on the surface of gram-positive bacteria possess a highly conserved 
carboxy terminal sequence. More than fifty such proteins from organisms such as £ 
pyogenes, S. mutans, E. faecalis, S. pneumoniae, and others, have been identified based 
on their extracellular location and carboxy terminal amino acid sequence (Fischetti, V. 

10 A., ASM News 62:405-410 (1996)). The conserved region consists of six charged 
amino acids at the extreme carboxy terminus coupled to 15-20 hydrophobic amino 
acids presumed to function as a transmembrane domain. Immediately adjacent to the 
transmembrane domain is a six amino acid sequence conserved in nearly all proteins 
examined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any 

1 5 amino acid. 

An algorithm for selecting antigenic and immunogenic Enterococcus faecalis 
polypeptides including the foregoing criteria was developed. The algorithm is similar 
to that described in U.S. patent application 08/781,986, filed January 3, 1997, which 
is fully incorporated by reference herein. Use of the algorithm by the inventors to 
20 select immunologically useful Enterococcus faecalis polypeptides resulted in the 
selection of a number of the disclosed ORFs. Polypeptides comprising the 
polypeptides identified in this group may be produced by techniques standard in the 
art and as further described herein. 

25 Nucleic Acid Molecules 

Sequenced E. faecalis genomic DNA was obtained from the E. faecalis strainV586. As 
discussed elsewhere hererin, polynucleotides of the present invention readily may be 
obtained by routine application of well known and standard procedures for cloning 
and sequencing DNA. Detailed methods for obtaining libraries and for sequencing are 
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provided below, for instance. A wide variety of Enterococcus faecalis strains that can 
be used to prepare E. faecalis genomic DNA for cloning and for obtaining 
polynucleotides and polypeptides of the present invention. A wide variety of 
Enterococcus faecalis strains are available to the public from recognized depository 

5 institutions, such as the American Type Culture Collection (ATCC). It is recognized 
that minor variation is the nucleic acid and amino acid sequence may be expected from 
E faecalis strain to strain. The present invention provides for genes, including both 
polynucleotides and polypeptides, of the of the present invention from all the 
Enterococcus faecalis strains. 

10 Unless otherwise indicated, all nucleotide sequences determined by sequencing 

a DNA molecule herein were determined using an automated DNA sequencer (such as 
the Model 373 from Applied Biosystems, Inc., Foster City, CA), and all amino acid 
sequences of polypeptides encoded by DNA molecules determined herein were 
predicted by translation of a DNA sequence determined as above. Therefore, as is 

15 known in the art for any DNA sequence determined by this automated approach, any 
nucleotide sequence determined herein may contain some errors. Nucleotide 
sequences determined by automation are typically at least about 90% identical, more 
typically at least about 95% to at least about 99.9% identical to the actual nucleotide 
sequence of the sequenced DNA molecule. The actual sequence can be more 

20 precisely determined by other approaches including manual DNA sequencing methods 
well known in the art. As is also known in the art, a single insertion or deletion in a 
determined nucleotide sequence compared to the actual sequence will cause a frame 
shift in translation of the nucleotide sequence such that the predicted amino acid 
sequence encoded by a determined nucleotide sequence will be completely different 

25 from the amino acid sequence actually encoded by the sequenced DNA molecule, 
beginning at the point of such an insertion or deletion. In case of conflict between 
Table 1 and either the nucleic acid sequence of the clones listed in Table 1 or the amino 
acid sequence of the protein expressed by the clones listed in Table 1, the clones listed 
in Table 1 are controlling. By "nucleotide sequence" of a nucleic acid molecule or 
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polynucleotide is intended to mean either a DNA or RNA sequence.Using the 
information provided herein, such as the nucleotide sequence in Table 1, a nucleic acid 
molecule of the present invention encoding a E.faecalis polypeptide may be obtained 
using standard cloning and screening procedures, such as those for cloning DNAs 

5 using genomic DNA as starting material. See, e.g., Sambrook et al. MOLECULAR 
CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 
1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY 
(John Wiley and Sons, N.Y. 1989). Illustrative of the invention, the nucleic acid 
molecule described in Table 1 was discovered in a DNA library derived from a E. 

1 0 faecalis genomic DNA. 

Nucleic acid molecules of the present invention may be in the form of RNA, 
such as mRNA, or in the form of DNA, including, for instance, DNA and genomic 
DNA obtained by cloning or produced synthetically. The DNA may be 
double-stranded or single-stranded. Single-stranded DNA or RNA may be the coding 

15 strand, also known as the sense strand, or it may be the non-coding strand, also 
referred to as the anti-sense strand. 

By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, 
DNA or RNA, which has been removed from its native environment. This includes 
segments of DNA comprising the E.faecalis polynucleotides of the present invention 

20 isolated from the native chromosome. These fragments include both isolated 

fragments consisting only of E.faecalis DNA and fragments comprising heterologous 
sequences such as vector sequences or other foreign DNA. For example, recombinant 
DNA molecules contained in a vector are considered isolated for the purposes of the 
present invention. Further examples of isolated DNA molecules include recombinant 

25 DNA molecules maintained in heterologous host cells or purified (partially or 

substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or 
in vitro RNA transcripts of the DNA molecules of the present invention. Isolated 
nucleic acid molecules according to the present invention further include such 
molecules produced synthetically. 
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In addition, isolated nucleic acid molecules of the invention include DNA 
molecules which comprise a sequence substantially different from those described 
above but which, due to the degeneracy of the genetic code, still encode a E.faecalis 
polypeptides and peptides of the present invention (e.g. polypeptides of Table 1). 

5 That is, all possible DNA sequences that encode the E. faecalis polypeptides of the 
present invention. This includes the genetic code and species-specific codon 
preferences known in the art. Thus, it would be routine for one skilled in the art to 
generate the degenerate variants described above, for instance, to optimize codon 
expression for a particular host (e.g., change codons in the bacteria mRNA to those 

1 0 preferred by a mammalian or other bacterial host such as E. coli). 

The invention further provides isolated nucleic acid molecules having the 
nucleotide sequence shown in Table 1 or a nucleic acid molecule having a sequence 
complementary to one of the above sequences. Such isolated molecules, particularly 
DNA molecules, are useful as probes for gene mapping and for identifying E.faecalis 

15 in a biological sample, for instance, by PCR, Southern blot, Northern blot, or other 
form of hybridization analysis. 

The present invention is further directed to nucleic acid molecules encoding 
portions or fragments of the nucleotide sequences described herein. Fragments include 
portions of the nucleotide sequences of Table 1, or the E.faecalis nucleotide 

20 sequences contained in the plasimd clones listed in Table 1 , at least 10 contiguous 
nucleotides in length selected from any two integers, one of which representing a 5' 
nucleotide position and a second of which representing a 3' nucleotide position, where 
the first nucleotide for each nucleotide sequence in Table 1 is position 1. That is, 
every combination of a 5' and 3 r nucleotide position that a fragment at least 10 

25 contiguous nucleotides in length could occupy is included in the invention. At least 
means a fragment may be 10 contiguous nucleotide bases in length or any integer 
between 10 and the length of an entire nucleotide sequence of Table 1 minus 1. 
Therefore, included in the invention are contiguous fragments specified by any 5' and 
3' nucleotide base positions of a nucleotide sequences of Table 1 wherein the 



WO 98/50554 



-16- 



PCT/US98/08959 



contiguous fragment is any integer between 10 and the length of an entire nucleotide 
sequence minus 1 . 

Further, the invention includes polynucleotides comprising fragments specified 
by size, in nucleotides, rather than by nucleotide positions. The invention includes 

5 any fragment size, in contiguous nucleotides, selected from integers between 10 and 
the length of an entire nucleotide sequence minus 1 . Preferred sizes of contiguous 
nucleotide fragments include 20 nucleotides, 30 nucleotides, 40 nucleotides, 50 
nucleotides. Other preferred sizes of contiguous nucleotide fragments, which may be 
useful as diagnostic probes and primers, include fragments 50-300 nucleotides in 

10 length which include, as discussed above, fragment sizes representing each integer 

between 50-300. Larger fragments are also useful according to the present invention ■ 
corresponding to most, if not all, of the nucleotide sequences shown in Table 1 or of 
the E. faecalis nucleotide sequences of the plasimd clones listed in Table 1 . The 
preferred sizes are, of course, meant to exemplify not limit the present invention as all 

15 size fragments, representing any integer between 10 and the length of an entire 
nucleotide sequence minus 1, are included in the invention. Additional preferred 
nucleic acid fragments of the present invention include nucleic acid molecules encoding 
epitope-bearing portions of E. faecalis polypeptides identified in Table 4. 

The present invention also provides for the exclusion of any fragment, 

20 specified by 5* and 3 1 base positions or by size in nucleotide bases as described above 
for any nucleotide sequence of Table 1 or the plasimd clones listed in Table 1 . Any 
number of fragments of nucleotide sequences in Table 1 or the plasimd clones listed in 
Table 1, specified by 5 1 and 3' base positions or by size in nucleotides, as described 
above, may be excluded from the present invention. 

25 In another aspect, the invention provides an isolated nucleic acid molecule 

comprising a polynucleotide which hybridizes under stringent hybridization 
conditions to a portion of a polynucleotide in a nucleic acid molecules of the invention 
described above, for instance, nucleotide sequences of Table 1 or the E. faecalis 
sequences of the plasimd clones listed in Table 1 . By "stringent hybridization 



WO 98/50554 



-17- 



PCT/US98/08959 



conditions 11 is intended overnight incubation at 42°C in a solution comprising: 50% 
formamide, 5x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 |ig/ml 
denatured, sheared salmon sperm DNA, followed by washing the filters in 0.1 x SSC at 
5 about 65 °C 

By a polynucleotide which hybridizes to a "portion 11 of a polynucleotide is 
intended a polynucleotide (either DNA or RNA) hybridizing to at least about 15 
nucleotides bases, and more preferably at least about 20 nucleotides bases, still more 
preferably at least about 30 nucleotides bases, and even more preferably about 30-70 

10 (e.g., 50) nucleotides bases of the reference polynucleotide. These are useful as 

diagnostic probes and primers as discussed above. By a portion of a polynucleotide 
of "at least 20 nucleotides bases in length," for example, is intended 20 or more 
contiguous nucleotides bases nucleotides from the nucleotide sequence of the reference 
polynucleotide (e.g., the nucleotide sequence as shown in Table 1). Portions of a 

15 polynucleotide which hybridizes to a nucleotide sequence in Table 1 , which can be 
used as probes and primers, may also be precisely specified by 5' and 3' base 
positions or by size in nucleotide bases as described above or precisely excluded in the 
same manner. 

The nucleic acid molecules of the present invention include those encoding the 
20 full length E. faecalis polypeptides of Table 1 and portions of the E. faecalis 

polypeptides of Table 1. Also included in the present invention are nucleic acids 
encoding the above full length sequences and further comprise additional sequences, 
such as those encoding an added secretory leader sequence, such as a pre-, or pro- or 
prepro- protein sequence. Further included in the present invention are nucleic acids 
25 encoding the above full length sequences and portions thereof and further comprise 
additional heterologous amino acid sequences encoded by nucleic acid sequences from 
a different source. 

Also included in the present invention are nucleic acids encoding the above 
protein sequences together with additional, non-coding sequences, including for 
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example, but not limited to non-coding 5' and 3' sequences. These sequences include 
transcribed, non-translated sequences that may play a role in transcription, and 
mRNA processing, for example, ribosome binding and stability of mRNA. Also 
included in the present invention are additional coding sequences which provide 
5 additional functionalities. 

Thus, a nucleotide sequence encoding a polypeptide may be fused to a marker 
sequence, such as a sequence encoding a peptide which facilitates purification of the 
fused polypeptide. In certain preferred embodiments of this aspect of the invention, 
the marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in 
10 a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), among 
others, many of which are commercially available. For instance, hexa-histidine 
provides for convenient purification of the fusion protein. See Gentz et al. (1989) 
Proc. Natl. Acad. Sci. 86:821-24. The "HA" tag is another peptide useful for 
purification which corresponds to an epitope derived from the influenza hemagglutinin 
15 protein. See Wilson et al. (1984) Cell 37:767. As discussed below, other such fusion 
proteins include the E. faecalis polypeptides of the present invention fused to Fc at 
the N- or C-terminus. 

Variant and Mutant Polynucleotides 

The present invention further relates to variants of the nucleic acid molecules 
which encode portions, analogs or derivatives of a E. faecalis polypeptides of Table 1 
and variant polypeptides thereof including portions, analogs, and derivatives of the E. 
faecalis polypeptides. Variants may occur naturally, such as a natural allelic variant. 
By an "allelic variant" is intended one of several alternate forms of a gene occupying a 
given locus on a chromosome of an organism. See, e.g., B. Lewin, Genes IV (1990). 
Non-naturally occurring variants may be produced using art-known mutagenesis 
techniques. 

Such nucleic acid variants include those produced by nucleotide substitutions, 
deletions, or additions. The substitutions, deletions, or additions may involve one or 
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more nucleotides. The variants may be altered in coding regions, non-coding regions, 
or both. Alterations in the coding regions may produce conservative or 
non-conservative amino acid substitutions, deletions or additions. Especially 
preferred among these are silent substitutions, additions and deletions, which do not 
5 alter the properties and activities of a E. faecalis protein of the present invention or 
portions thereof. Also especially preferred in this regard are conservative 
substitutions. 

Such polypeptide variants include those produced by amino acid 
substitutions, deletions or additions. The substitutions, deletions, or additions may 

10 involve one or more residues. Alterations may produce conservative or 

non-conservative amino acid substitutions, deletions, or additions. Especially 
preferred among these are silent substitutions, additions and deletions, which do not 
alter the properties and activities of a E, faecalis protein of the present invention or 
portions thereof. Also especially preferred in this regard are conservative 

15 substitutions. 

The present invention also relates to recombinant vectors, which include the 
isolated nucleic acid molecules of the present invention, and to host cells containing 
the recombinant vectors, as well as to methods of making such vectors and host cells 
and for using them for production of E. faecalis polypeptides or peptides by 

20 recombinant techniques. 

The present application is directed to nucleic acid molecules at least 90%, 
95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in Table 1 . 
The above nucleic acid sequences are included irrespective of whether they encode a 
polypeptide having E. faecalis activity. This is because even where a particular 

25 nucleic acid molecule does not encode a polypeptide having E. faecalis activity, one of 
skill in the art would still know how to use the nucleic acid molecule, for instance, as a 
hybridization probe. Uses of the nucleic acid molecules of the present invention that 
do not encode a polypeptide having E. faecalis activity include, inter alia, isolating an 
E. faecalis gene or allelic variants thereof from a DNA library, and detecting E. faecalis 
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mRNA expression samples, environmental samples, suspected of containing E. 
faecalis by Northern Blot analysis. 

Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 
97%, 98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, 
5 in fact, encode a polypeptide having E. faecalis protein activity By n a polypeptide 
having E. faecalis activity" is intended polypeptides exhibiting activity similar, but 
not necessarily identical, to an activity of the E. faecalis protein of the invention, as 
measured in a particular biological assay suitable for measuring activity of the 
specified protein. 

10 Due to the degeneracy of the genetic code, one of ordinary skill in the art will 

immediately recognize that a large number of the nucleic acid molecules having a 
sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to the nucleic acid 
sequences shown in Table 1 will encode a polypeptide having E. faecalis protein 
activity. In fact, since degenerate variants of these nucleotide sequences all encode the 

15 same polypeptide, this will be clear to the skilled artisan even without performing the 
above described comparison assay. It will be further recognized in the art that, for 
such nucleic acid molecules that are not degenerate variants, a reasonable number will 
also encode a polypeptide having E. faecalis protein activity. This is because the 
skilled artisan is fully aware of amino acid substitutions that are either less likely or 

20 not likely to significantly effect protein function (e.g., replacing one aliphatic amino 
acid with a second aliphatic amino acid), as further described below. 

The biological activity or function of the polypeptides of the present 
invention are expected to be similar or identical to polypeptides from other bacteria 
that share a high degree of structural identity/similarity. Tables 2 lists accession 

25 numbers and descriptions for the closest matching sequences of polypeptides 

available through Genbank and Derwent databases. It is therefore expected that the 
biological activity or function of the polypeptides of the present invention will be 
similar or identical to those polypeptides from other bacterial genuses, species, or 
strains listed in Table 2. 
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By a polynucleotide having a nucleotide sequence at least, for example, 95% 
"identical* 1 to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference 
sequence except that the polynucleotide sequence may include up to five point 
5 mutations per each 1 00 nucleotides of the reference nucleotide sequence encoding the 
E. faecalis polypeptide. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 
5% of the nucleotides in the reference sequence may be deleted, inserted, or 
substituted with another nucleotide. The query sequence may be an entire sequence 
10 shown in Table 1 , the ORF (open reading frame), or any fragment specified as 
described herein. 

As a practical matter, whether any particular nucleic acid molecule or 
polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invention can be determined conventionally using known 

15 computer programs. A preferred method for determining the best overall match 
between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using the 
FASTDB computer program based on the algorithm of Brutlag et al. See Brutlag et 
al. (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and 

20 subject sequences are both DNA sequences. An RNA sequence can be compared by 
first converting U's to T's. The result of said global sequence alignment is in percent 
identity. Preferred parameters used in a FASTDB alignment of DNA sequences to 
calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, 
Joining Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap 

25 Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject 
nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3 ' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' 
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truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3' ends, relative to the query sequence, the percent 
identity is corrected by calculating the number of bases of the query sequence that are 
5' and 3' of the subject sequence, which are not matched/aligned, as a percent of the 
5 total bases of the query sequence. Whether a nucleotide is matched/aligned is 

determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
score is what is used for the purposes of the present invention. Only nucleotides 

10 outside the 5' and 3' nucleotides of the subject sequence, as displayed by the 
FASTDB alignment, which are not matched/aligned with the query sequence, are 
calculated for the purposes of manually adjusting the percent identity score. 

For example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide 
query sequence to determine percent identity. The deletions occur at the 5' end of the 

15 subject sequence and therefore, the FASTDB alignment does not show a 

matched/alignment of the first 1 0 nucleotides at 5' end. The 10 unpaired nucleotides 
represent 10% of the sequence (number of nucleotides at the 5' and 3' ends not 
matched/total number of nucleotides in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 

20 nucleotides were perfectly matched the final percent identity would be 90%. In 

another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide 
query sequence. This time the deletions are internal deletions so that there are no 
nucleotides on the 5' or 3' of the subject sequence which are not matched/aligned with 
the query. In this case the percent identity calculated by FASTDB is not manually 

25 corrected. Once again, only nucleotides 5' and 3' of the subject sequence which are 
not matched/aligned with the query sequence are manually corrected for. No other 
manual corrections are to made for the purposes of the present invention. 

Vectors and Host Cell 
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The present invention also relates to vectors which include the isolated DNA 
molecules of the present invention, host cells comprising the recombinant vectors, and 
the production of E.faecalis polypeptides and peptides of the present invention 
expressed by the host cells. 
5 Recombinant constructs may be introduced into host cells using well known 

techniques such as infection, transduction, transfection, transvection, electroporation 
and transformation. The vector may be, for example, a phage, plasmid, viral or 
retroviral vector. Retroviral vectors may be replication competent or replication 
defective. In the latter case, viral propagation generally will occur only in 

1 0 complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 

15 and then transduced into host cells. 

Preferred are vectors comprising m-acting control regions to the 
polynucleotide of interest. Appropriate *ra/w-acting factors may be supplied by the 
host, supplied by a complementing vector or supplied by the vector itself upon 
introduction into the host. 

20 In certain preferred embodiments in this regard, the vectors provide for 

specific expression, which may be inducible and/or cell type-specific. Particularly 
preferred among such vectors are those inducible by environmental factors that are 
easy to manipulate, such as temperature and nutrient additives. 

Expression vectors useful in the present invention include chromosomal-, 

25 episomal- and virus-derived vectors, e.g., vectors derived from bacterial plasmids, 
bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as 
baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, 
such as cosmids and phagemids; 
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The DNA insert should be operatively linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 

5 constructs will further contain sites for transcription initiation, termination and, in the 
transcribed region, a ribosome binding site for translation. The coding portion of the 
mature transcripts expressed by the constructs will preferably include a translation 
initiating site at the beginning and a termination codon (UAA, UGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 

10 As indicated, the expression vectors will preferably include at least one 

selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin, or ampicillin 
resistance genes for culturing in E. coli and other bacteria. Representative examples of 
appropriate hosts include, but are not limited to, bacterial cells, such as E, coli, 

15 Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE9, 

20 pQElO available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, 
pNH8A, pNH16a, pNH18A, pNH46A available from Stratagene; pET series of 
vectors available from Novagen; and ptrc99a, pKK223-3, pKK233-3, pDR540, 
pRIT5 available from Pharmacia. Among preferred eukaryotic vectors are pWLNEO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and pSVK3, pBPV, 

25 pMSG and pSVL available from Pharmacia. Other suitable vectors will be readily 
apparent to the skilled artisan. 

Among known bacterial promoters suitable for use in the present invention 
include the E. coli lac\ and lacZ promoters, the T3, T5 and T7 promoters, the gpt 
promoter, the lambda PR and PL promoters and the trp promoter. Suitable eukaryotic 
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promoters include the CMV immediate early promoter, the HSV thymidine kinase 
promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such 
as those of the Rous sarcoma virus (RSV), and metallothionein promoters, such as the 
mouse metallothionein-I promoter. 
5 Introduction of the construct into the host cell can be effected by calcium 

phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation, transduction, infection or other methods. Such methods 
are described in many standard laboratory manuals (for example, Davis, et al t Basic 
Methods In Molecular Biology ( 1 986)). 

10 Transcription of DNA encoding the polypeptides of the present invention by 

higher eukaryotes may be increased by inserting an enhancer sequence into the vector. 
Enhancers are cw-acting elements of DNA, usually about from 10 to 300 nucleotides 
that act to increase transcriptional activity of a promoter in a given host cell-type. 
Examples of enhancers include the SV40 enhancer, which is located on the late side of 

15 the replication origin at nucleotides 100 to 270, the cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and 
adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the extracellular environment, 

20 appropriate secretion signals may be incorporated into the expressed polypeptide, for 
example, the amino acid sequence KDEL. The signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

The polypeptide may be expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional heterologous 

25 functional regions. For instance, a region of additional amino acids, particularly 

charged amino acids, may be added to the N-terminus of the polypeptide to improve 
stability and persistence in the host cell, during purification, or during subsequent 
handling and storage. Also, peptide moieties may be added to the polypeptide to 
facilitate purification. Such regions may be removed prior to final preparation of the 
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polypeptide. The addition of peptide moieties to polypeptides to engender secretion 
or excretion, to improve stability and to facilitate purification, among others, are 
familiar and routine techniques in the art. A preferred fusion protein comprises a 
heterologous region from immunoglobulin that is useful to solubilize proteins. For 
5 example, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion proteins 
comprising various portions of constant region of immunoglobulin molecules together 
with another human protein or part thereof. In many cases, the Fc part in a fusion 
protein is thoroughly advantageous for use in therapy and diagnosis and thus results, 
for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 

1 0 hand, for some uses it would be desirable to be able to delete the Fc part after the 
fusion protein has been expressed, detected and purified in the advantageous manner 
described. This is the case when Fc portion proves to be a hindrance to use in 
therapy and diagnosis, for example when the fusion protein is to be used as antigen for 
immunizations. In drug discovery, for example, human proteins, such as, 

15 hIL5-receptor has been fused with Fc portions for the purpose of high-throughput 
screening assays to identify antagonists of hIL-5. See Bennett, D. et al. (1995) J. 
Molec. Recogn. 8:52-58 and Johanson, K. et al. (1995) J. Biol. Chem. 270 
(16):9459-9471. 

The E. faecalis polypeptides can be recovered and purified from recombinant 
20 ceil cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography, lectin chromatography and high 
performance liquid chromatography ("HPLC") is employed for purification. 
25 Polypeptides of the present invention include naturally purified products, products of 
chemical synthetic procedures, and products produced by recombinant techniques 
from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. 
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Polypeptides and Fragments 

The invention further provides an isolated E.faecalis polypeptide having an 
amino acid sequence in Table 1 , or a peptide or polypeptide comprising a portion of 
the above polypeptides. 

5 

Variant and Mutant Polypeptides 

To improve or alter the characteristics of E.faecalis polypeptides of the 
present invention, protein engineering may be employed. Recombinant DNA 
technology known to those skilled in the art can be used to create novel mutant 
10 proteins or muteins including single or multiple amino acid substitutions, deletions, 
additions, or fusion proteins. Such modified polypeptides can show, e.g., enhanced 
activity or increased stability. In addition, they may be purified in higher yields and 
show better solubility than the corresponding natural polypeptide, at least under 
certain purification and storage conditions. 

15 

N-Terminal and C-Terminal Deletion Mutants 

It is known in the art that one or more amino acids may be deleted from the 
N-terminus or C-terminus without substantial loss of biological function. For 
instance, Ron et al. J. Biol. Chem., 268:2984-2988 (1993), reported modified KGF 

20 proteins that had heparin binding activity even if 3, 8, or 27 N-terminal amino acid 
residues were missing. Accordingly, the present invention provides polypeptides 
having one or more residues deleted from the amino terminus of the amino acid 
sequence of the E. faecalis polypeptides shown in Table 1 , and polynucleotides 
encoding such polypeptides. 

25 Similarly, many examples of biologically functional C-terminal deletion 

muteins are known. For instance, Interferon gamma shows up to ten times higher 
activities by deleting 8-10 amino acid residues from the carboxy terminus of the 
protein See, e.g., Dobeli, et al. (1988) J. Biotechnology 7:199-216. Accordingly, the 
present invention provides polypeptides having one or more residues from the 
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carboxy terminus of the amino acid sequence of the E.faecalis polypeptides shown in 
Table 1. The invention also provides polypeptides having one or more amino acids 
deleted from both the amino and the carboxyl termini as described below. 

The present invention is further directed to polynucleotide encoding portions 
5 or fragments of the amino acid sequences described herein as well as to portions or 
fragments of the isolated amino acid sequences described herein. Fragments include 
portions of the amino acid sequences of Table 1, are at least 5 contiguous amino acid 
in length, are selected from any two integers, one of which representing a N-terminal 
position. The initiation codon of the polypeptides of the present inventions position 

10 1 . Every combination of a N-terminal and C-terminal position that a fragment at least 
5 contiguous amino acid residues in length could occupy, on any given amino acid 
sequence of Table 1 is included in the invention. At least means a fragment may be 5 
contiguous amino acid residues in length or any integer between 5 and the number of 
residues in a full length amino acid sequence minus 1 . Therefore, included in the 

15 invention are contiguous fragments specified by any N-terminal and C-terminal 

positions of amino acid sequence set forth in Table 1 wherein the contiguous fragment 
is any integer between 5 and the number of residues in a frill length sequence minus 1 . 

Further, the invention includes polypeptides comprising fragments specified 
by size, in amino acid residues, rather than by N-terminal and C-terminal positions. 

20 The invention includes any fragment size, in contiguous amino acid residues, selected 
from integers between 5 and the number of residues in a full length sequence minus 1 . 
Preferred sizes of contiguous polypeptide fragments include about 5 amino acid 
residues, about 10 amino acid residues, about 20 amino acid residues, about 30 amino 
acid residues, about 40 amino acid residues, about 50 amino acid residues, about 100 

25 amino acid residues, about 200 amino acid residues, about 300 amino acid residues, 
and about 400 amino acid residues. The preferred sizes are, of course, meant to 
exemplify, not limit, the present invention as all size fragments representing any 
integer between 5 and the number of residues in a full length sequence minus 1 are 
included in the invention. The present invention also provides for the exclusion of any 
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fragments specified by N-terminal and C-terminal positions or by size in amino acid 
residues as described above. Any number of fragments specified by N-terminal and 
C-terminal positions or by size in amino acid residues as described above may be 
excluded. 

5 The above fragments need not be active since they would be useful, for 

example, in immunoassays, in epitope mapping, epitope tagging, to generate 
antibodies to a particular portion of the protein, as vaccines, and as molecular weight 
markers. 

10 Other Mutants 

In addition to N- and C-terminal deletion forms of the protein discussed above, 
it also will be recognized by one of ordinary skill in the art that some amino acid 
sequences of the E.faecalis polypeptide can be varied without significant effect of the 
structure or function of the protein. If such differences in sequence are contemplated, 

15 it should be remembered that there will be critical areas on the protein which 
determine activity. 

Thus, the invention further includes variations of the E.faecalis polypeptides 
which show substantial E.faecalis polypeptide activity or which include regions of E. 
faecalis protein such as the protein portions discussed below. Such mutants include 

20 deletions, insertions, inversions, repeats, and type substitutions selected according to 
general rules known in the art so as to have little effect on activity. For example, 
guidance concerning how to make phenotypically silent amino acid substitutions is 
provided. There are two main approaches for studying the tolerance of an amino acid 
sequence to change. See, Bowie, J. U. et al (1990), Science 247:1306-1310. The first 

25 method relies on the process of evolution, in which mutations are either accepted or 
rejected by natural selection. The second approach uses genetic engineering to 
introduce amino acid changes at specific positions of a cloned gene and selections or 
screens to identify sequences that maintain functionality. 

These studies have revealed that proteins are surprisingly tolerant of amino 
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acid substitutions. The studies indicate which amino acid changes are likely to be 
permissive at a certain position of the protein. For example, most buried amino acid 
residues require nonpolar side chains, whereas few features of surface side chains are 
generally conserved. Other such phenotypically silent substitutions are described by 

5 Bowie et al. {supra) and the references cited therein. Typically seen as conservative 
substitutions are the replacements, one for another, among the aliphatic amino acids 
Ala, Val, Leu and He; interchange of the hydroxyl residues Ser and Thr, exchange of 
the acidic residues Asp and Glu, substitution between the amide residues Asn and 
Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic 

10 residues Phe, Tyr. 

Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 
1, or that encoded by the plaimds listed in Table 1, may be: (i) one in which one or 
more of the amino acid residues are substituted with a conserved or non-conserved 
amino acid residue (preferably a conserved amino acid residue) and such substituted 

15 amino acid residue may or may not be one encoded by the genetic code: or (ii) one in 
which one or more of the amino acid residues includes a substituent group: or (iii) one 
in which the E. faecalis polypeptide is fused with another compound, such as a 
compound to increase the half-life of the polypeptide (for example, polyethylene 
glycol): or (iv) one in which the additional amino acids are fused to the above form of 

20 the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory 

sequence or a sequence which is employed for purification of the above form of the 
polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are 
deemed to be within the scope of those skilled in the art from the teachings herein. 

Thus, the E. faecalis polypeptides of the present invention may include one or 

25 more amino acid substitutions, deletions, or additions, either from natural mutations or 
human manipulation. As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding or 
activity of the protein (see Table 3). 

Amino acids in the E. faecalis proteins of the present invention that are 
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essential for function can be identified by methods known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis. See, e.g., Cunningham et al. 
(1989) Science 244:1081-1085. The latter procedure introduces single alanine 
mutations at every residue in the molecule. The resulting mutant molecules are then 

5 tested for biological activity using assays appropriate for measuring the function of 
the particular protein. 

Of special interest are substitutions of charged amino acids with other charged 
or neutral amino acids which may produce proteins with highly desirable improved 
characteristics, such as less aggregation. Aggregation may not only reduce activity but 

10 . also be problematic when preparing pharmaceutical formulations, because aggregates 
can be immunogenic. See, e.g., Pinckard et al., (1967) Clin. Exp. Immunol. 2:331-340; 
Robbins, et al., (1987) Diabetes 36:838-845; Cleland, et al., (1993) Crit. Rev. 
Therapeutic Drug Carrier Systems 1 0:307-377. 

The polypeptides of the present invention are preferably provided in an 

15 isolated form, and preferably are substantially purified. A recombinantly produced 
version of the E.faecalis polypeptide can be substantially purified by the one-step 
method described by Smith et al. (1988) Gene 67:31-40. Polypeptides of the 
invention also can be purified from natural or recombinant sources using antibodies 
directed against the polypeptides of the invention in methods which are well known in 

20 the art of protein purification. 

The invention further provides for isolated E. faecalis polypeptides 
comprising an amino acid sequence selected from the group consisting of: (a) the 
amino acid sequence of a full-length E. faecalis polypeptide having the complete 
amino acid sequence shown in Table 1 ; (b) the amino acid sequence of a full-length E. 

25 faecalis polypeptide having the complete amino acid sequence shown in Table 1 
excepting the N-terminal methionine; (c) the complete amino acid sequence encoded 
by the plaimds listed in Table 1 ; and (d) the complete amino acid sequence excepting 
the N-terminal methionine encoded by the plaimds listed in Table 1 . The 
polypeptides of the present invention also include polypeptides having an amino acid 



WO 98/50554 



-32- 



PCT/US98/08959 



sequence at least 80% identical, more preferably at least 90% identical, and still more 
preferably 95%, 96%, 97%, 98% or 99% identical to those described in (a), (b), (c), 
and (d) above. 

Further polypeptides of the present invention include polypeptides which 

5 have at least 90% similarity, more preferably at least 95% similarity, and still more 
preferably at least 96%, 97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which 
comprises the amino acid sequence of a E.faecalis polypeptide having an amino acid 
sequence which contains at least one conservative amino acid substitution, but not 

10 more than 50 conservative amino acid substitutions, not more than 40 conservative 
amino acid substitutions, not more than 30 conservative amino acid substitutions, and 
not more than 20 conservative amino acid substitutions. Also provided are 
polypeptides which comprise the amino acid sequence of a E.faecalis polypeptide, 
having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino 

15 acid substitutions. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 

20 alterations per each 100 amino acids of the query amino acid sequence. In other 

words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted, (indels) or substituted with another amino acid. 
These alterations of the reference sequence may occur at the amino or carboxy 

25 terminal positions of the reference amino acid sequence or anywhere between those 
terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 
95%, 96%, 97%, 98%> or 99% identical to, for instance, the amino acid sequences 
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shown in Table 1 or to the amino acid sequence encoded by the plaimds listed in Table 
1 can be detennined conventionally using known computer programs. A preferred 
method for determining the best overall match between a query sequence (a sequence 
of the present invention) and a subject sequence, also referred to as a global sequence 
5 alignment, can be detennined using the FASTDB computer program based on the 
algorithm of Brutlag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence 
alignment the query and subject sequences are both amino acid sequences. The result 
of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch 

10 Penal ty=l, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=l, 
Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window 
Size=500 or the length of the subject amino acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
terminal deletions, not because of internal deletions, the results, in percent identity, 

15 must be manually corrected. This is because the FASTDB program does not account 
for N- and C-terminal truncations of the subject sequence when calculating global 
percent identity. For subject sequences truncated at the N- and C-termini, relative to 
the query sequence, the percent identity is corrected by calculating the number of 
residues of the query sequence that are N- and C-terminal of the subject sequence, 

20 which are not matched/aligned with a corresponding subject residue, as a percent of 
the total bases of the query sequence. Whether a residue is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This final percent 

25 identity score is what is used for the purposes of the present invention. Only 
residues to the N- and C-termini of the subject sequence, which are not 
matched/aligned with the query sequence, are considered for the purposes of manually 
adjusting the percent identity score. That is, only query amino acid residues outside 
the farthest N- and C-terminal residues of the subject sequence. 
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For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not 
match/align with the first 10 residues at the N-terminus. The 10 unpaired residues 
5 represent 1 0% of the sequence (number of residues at the N- and C- termini not 
matched/total number of residues in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program. If the remaining 90 
residues were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue query 

10 sequence. This time the deletions are internal so there are no residues at the N- or C- 
termini of the subject sequence which are not matched/aligned with the query. In this 
case the percent identity calculated by FASTDB is not manually corrected. Once 
again, only residue positions outside the N- and C-terminal ends of the subject 
sequence, as displayed in the FASTDB alignment, which are not matched/aligned 

15 with the query sequence are manually corrected. No other manual corrections are to 
made for the purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they 
have their normal biological activity. This is because even where a particular 
polypeptide molecule does not have biological activity, one of skill in the art would 

20 still know how to use the polypeptide, for instance, as a vaccine or to generate 

antibodies. Other uses of the polypeptides of the present invention that do not have 
E.faecalis activity include, inter alia, as epitope tags, in epitope mapping, and as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods known to those of skill in the art. 

25 As described below, the polypeptides of the present invention can also be 

used to raise polyclonal and monoclonal antibodies, which are useful in assays for 
detecting E.faecalis protein expression or as agonists and antagonists capable of 
enhancing or inhibiting E.faecalis protein function. Further, such polypeptides can be 
used in the yeast two-hybrid system to "capture" E. faecalis protein binding proteins 
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which are also candidate agonists and antagonists according to the present invention. 
See, e.g., Fields et al. (1989) Nature 340:245-246. 

Epitope-B earing Portions 
5 In another aspect, the invention provides peptides and polypeptides 

comprising epitope-bearing portions of the E. faecalis polypeptides of the present 
invention. These epitopes are immunogenic or antigenic epitopes of the polypeptides 
of the present invention. An "immunogenic epitope" is defined as a part of a protein 
that elicits an antibody response when the whole protein or polypeptide is the 

10 immunogen. These immunogenic epitopes are believed to be confined to a few loci on 
the molecule. On the other hand, a region of a protein molecule to which an antibody 
can bind is defined as an "antigenic determinant" or "antigenic epitope." The number 
of immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, e.g., Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002. 

15 Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 
4 only lists amino acid residues comprising epitopes predicted to have the highest 
degree of antigenicity. The polypeptides not listed in Table 4 and portions of 
polypeptides not listed in Table 4 are not considered non-antigenic. This is because 
they may still be antigenic in vivo but merely not recognized as such by the particular 

20 algorithm used. Thus, Table 4 lists the amino acid residues comprising preferred 
antigenic epitopes but not a complete list. Amino acid residues comprising other 
anigenic epitopes may be determined by algorithms similar to the Jameson-Wolf 
analysis or by in vivo testing for an antigenic response using the methods described 
herein or those known in the art. 

25 As to the selection of peptides or polypeptides bearing an antigenic epitope 

{i.e., that contain a region of a protein molecule to which an antibody can bind), it is 
well known in that art that relatively short synthetic peptides that mimic part of a 
protein sequence are routinely capable of eliciting an antiserum that reacts with the 
partially mimicked protein. See, e.g., Sutcliffe, et al., (1983) Science 219:660-666. 
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Peptides capable of eliciting protein-reactive sera are frequently represented in the 
primary sequence of a protein, can be characterized by a set of simple chemical rules, 
and are confined neither to immunodominant regions of intact proteins (i.e., 
immunogenic epitopes) nor to the amino or carboxyl terminals. Peptides that are 

5 extremely hydrophobic and those of six or fewer residues generally are ineffective at 
inducing antibodies that bind to the mimicked protein; longer, peptides, especially 
those containing proline residues, usually are effective. See, Sutcliffe, et al., supra, p. 
661. For instance, 18 of 20 peptides designed according to these guidelines, containing 
8-39 residues covering 75% of the sequence of the influenza virus hemagglutinin HA1 

10 polypeptide chain, induced antibodies that reacted with the HA1 protein or intact 
virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the rabies 
glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 

15 specifically to a polypeptide of the invention. Thus, a high proportion of hybridomas 
obtained by fusion of spleen cells from donors immunized with an antigen 
epitope-bearing peptide generally secrete antibody reactive with the native protein. 
See Sutcliffe, et al., supra, p. 663. The antibodies raised by antigenic epitope-bearing 
peptides or polypeptides are useful to detect the mimicked protein, and antibodies to 

20 different peptides may be used for tracking the fate of various regions of a protein 
precursor which undergoes post-translational processing. The peptides and 
anti-peptide antibodies may be used in a variety of qualitative or quantitative assays 
for the mimicked protein, for instance in competition assays since it has been shown 
that even short peptides (e.g., about 9 amino acids) can bind and displace the larger 

25 peptides in immunoprecipitation assays. See, e.g., Wilson, et al., (1984) Cell 
37:767-778. The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption chromatography 
using methods known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
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designed according to the above guidelines preferably contain a sequence of at least 
seven, more preferably at least nine and most preferably between about 10 to about 
50 amino acids (i.e. any integer between 7 and 50) contained within the amino acid 
sequence of a polypeptide of the invention. However, peptides or polypeptides 

5 comprising a larger portion of an amino acid sequence of a polypeptide of the 
invention, containing about 50 to about 100 amino acids, or any length up to and 
including the entire amino acid sequence of a polypeptide of the invention, also are 
considered epitope-bearing peptides or polypeptides of the invention and also are 
useful for inducing antibodies that react with the mimicked protein. Preferably, the 

10 amino acid sequence of the epitope-bearing peptide is selected to provide substantial 
solubility in aqueous solvents (i.e., the sequence includes relatively hydrophilic 
residues and highly hydrophobic sequences are preferably avoided); and sequences 
containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be used 

15 to generate an enterococcal-specific immune response or antibodies include portions of 
the amino acid sequences identified in Table 1. More specifically, Table 4 discloses a 
list of non-limiting residues that are involved in the antigenicity of the epitope-bearing 
fragments of the present invention. Therefore, the present inventions provides for 
isolatd and purified antigenic epitope-bearing fragements of the polypeptides of the 

20 present invention comprising a peptide sequences of Table 4. The antigenic epitope- 
bearing fragments comprising a peptide sequence of Table 4 preferably contain a 
sequence of at least seven, more preferably at least nine and most preferably between 
about 10 to about 50 amino acids (i.e. any integer between 7 and 50) of a polypeptide 
of the present invention. That is, included in the present invention are antigenic 

25 polypeptides between the integers of 7 and 50 amino acid in length comprising one or 
more of the sequences of Table 4. Therefore, in most cases, the polypeptides of 
Table 4 make up only a portion of the antigenic polypeptide. All combinations of 
sequences between the integers of 7 and 50 amino acid in length comprising one or 
more of the sequences of Table 4 are included. The antigenic epitoperbearing 
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fragements may be specified by either the number of contiguous amino acid residues 
or by specific N-terminal and C-terminal positions as described above for the 
polypeptide fragements of the present invention, wherein the initiation codon is 
residue 1 . Any number of the described antigenic epitope-bearing fragements of the 
5 present invention may also be excluded from the present invention in the same 
manner. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means for making peptides or polypeptides including 
recombinant means using nucleic acid molecules of the invention. For instance, an 

10 epitope-bearing amino acid sequence of the present invention may be fused to a larger 
polypeptide which acts as a carrier during recombinant production and purification, as 
well as during immunization to produce anti-peptide antibodies. Epitope-bearing 
peptides also may be synthesized using known methods of chemical synthesis. For 
instance, Houghten has described a simple method for synthesis of large numbers of 

1 5 peptides, such as 1 0-20 mg of 248 different 1 3 residue peptides representing single 
amino acid variants of a segment of the HA1 polypeptide which were prepared and 
characterized (by ELISA-type binding studies) in less than four weeks (Houghten, R. 
A. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple 
Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 4,631,21 1 

20 to Houghten and coworkers (1986). In this procedure the individual resins for the 

solid-phase synthesis of various peptides are contained in separate solvent-permeable 
packets, enabling the optimal use of the many identical repetitive steps involved in 
solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad. 

25 Sci. 82:5131-5135 at 5134. 

Epitope-bearing peptides and polypeptides of the invention are used to induce 
antibodies according to methods well known in the art. See, e.g., Sutcliffe, et al., 
supra\\ Wilson, et al., supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354. 
Generally, animals may be immunized with free peptide; however, anti-peptide 
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antibody titer may be boosted by coupling of the peptide to a macrornolecular carrier, 
such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides 
containing cysteine may be coupled to carrier using a linker such as 
m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may 

5 be coupled to carrier using a more general linking agent such as glutaraldehyde. 
Animals such as rabbits, rats and mice are immunized with either free or 
carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal injection 
of emulsions containing about 100 (lg peptide or carrier protein and Freund's adjuvant. 
Several booster injections may be needed, for instance, at intervals of about two 

10 weeks, to provide a useful titer of anti-peptide antibody which can be detected, for 
example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by 
selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a 
solid support and elution of the selected antibodies according to methods well known 

15 in the art. 

Immunogenic epitope-bearing peptides of the invention, i.e., those parts of a 
protein that elicit an antibody response when the whole protein is the immunogen, are 
identified according to methods known in the art. For instance, Geysen, et al, supra, 
discloses a procedure for rapid concurrent synthesis on solid supports of hundreds of 

20 peptides of sufficient purity to react in an ELISA, Interaction of synthesized 
peptides with antibodies is then easily detected without removing them from the 
support. In this manner a peptide bearing an immunogenic epitope of a desired 
protein may be identified routinely by one of ordinary skill in the art For instance, 
the immunologically important epitope in the coat protein of foot-and-mouth disease 

25 virus was located by Geysen et al supra with a resolution of seven amino acids by 
synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 
213 amino acid sequence of the protein. Then, a complete replacement set of peptides 
in which all 20 amino acids were substituted in turn at every position within the 
epitope were synthesized, and the particular amino acids conferring specificity for the 
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reaction with antibody were determined. Thus, peptide analogs of the epitope-bearing 
peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,781 to Geysen (1987) further describes this method of identifying a peptide 
bearing an immunogenic epitope of a desired protein. 

5 Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general 

method of detecting or determining the sequence of monomers (amino acids or other 
compounds) which is a topological equivalent of the epitope (i.e., a "mimotope") 
which is complementary to a particular paratope (antigen binding site) of an antibody ' 
of interest. More generally, U.S. Patent No. 4,433,092, also to Geysen (1989), 

10 describes a method of detecting or determining a sequence of monomers which is a 
topographical equivalent of a ligand which is complementary to the ligand binding site 
of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 to Houghten, 
R. A. et al. (1996) discloses linear C r C 7 -alkyl peralkylated oligopeptides and sets and 
libraries of such peptides, as well as methods for using such oligopeptide sets and 

15 libraries for determining the sequence of a peralkylated oligopeptide that preferentially 
binds to an acceptor molecule of interest. Thus, non-peptide analogs of the 
epitope-bearing peptides of the invention also can be made routinely by these 
methods. The entire disclosure of each document cited in this section on 
"Polypeptides and Fragments" is hereby incorporated herein by reference. 

20 As one of skill in the art will appreciate, the polypeptides of the present 

invention and the epitope-bearing fragments thereof described above can be combined 
with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric 
polypeptides. These fusion proteins facilitate purification and show an increased 
half-life in vivo. This has been shown, e.g., for chimeric proteins consisting of the 

25 first two domains of the human CD4-polypeptide and various domains of the 

constant regions of the heavy or light chains of mammalian immunoglobulins. (EPA 
0,394,827; Traunecker et al. (1988) Nature 331:84-86. Fusion proteins that have a 
disulfide-linked dimeric structure due to the IgG part can also be more efficient in 
binding and neutralizing other molecules than a monomelic E. faecalis polypeptide or 
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fragment thereof alone. See Fountoulakis et aL (1995) J. Biochem. 270:3958-3964. 
Nucleic acids encoding the above epitopes of E, faecalis polypeptides can also be 
recombined with a gene of interest as an epitope tag to aid in detection and 
purification of the expressed polypeptide. 

5 

Antibodies 

E. faecalis protein-specific antibodies for use in the present invention can be 
raised against the intact E. faecalis protein or an antigenic polypeptide fragment 
thereof, which may be presented together with a carrier protein, such as an albumin, to 

10 an animal system (such as rabbit or mouse) or, if it is long enough (at least about 25 
amino acids), without a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules, single chain whole antibodies, and antibody 
fragments. Antibody fragments of the present invention include Fab and F(ab')2 and 

15 other fragments including single-chain Fvs (scFv) and disulfide-linked Fvs (sdFv). 
Also included in the present invention are chimeric and humanized monoclonal 
antibodies and polyclonal antibodies specific for the polypeptides of the present 
invention. The antibodies of the present invention may be prepared by any of a 
variety of methods. For example, cells expressing a polypeptide of the present 

20 invention or an antigenic fragment thereof can be administered to an animal in order to 
induce the production of sera containing polyclonal antibodies. For example, a 
preparation of E. faecalis polypeptide or fragment thereof is prepared and purified to 
render it substantially free of natural contaminants. Such a preparation is then 
introduced into an animal in order to produce polyclonal antisera of greater specific 

25 activity. 

In a preferred method, the antibodies of the present invention are monoclonal 
antibodies or binding fragments thereof. Such monoclonal antibodies can be prepared 
using hybridoma technology. See, e.g., Harlow et aL, ANTIBODIES: A 
LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); 
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Hammerling, et al., in: MONOCLONAL ANTIBODIES AND T-CELL 
HYBRIDOMAS 563-681 (Elsevier, N.Y., 1981). Fab and F(ab')2 fragments may be 
produced by proteolytic cleavage, using enzymes such as papain (to produce Fab 
fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, E.faecalis 

5 polypeptide-binding fragments, chimeric, and humanized antibodies can be produced 
through the application of recombinant DNA technology or through synthetic 
chemistry using methods known in the art. 

Alternatively, additional antibodies capable of binding to the polypeptide 
antigen of the present invention may be produced in a two-step procedure through the 

10 use of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies 
are themselves antigens, and that, therefore, it is possible to obtain an antibody which 
binds to a second antibody. In accordance with this method, E. faecalis 
polypeptide-specific antibodies are used to immunize an animal, preferably a mouse. 
The splenocytes of such an animal are then used to produce hybridoma cells, and the 

15 hybridoma cells are screened to identify clones which produce an antibody whose 

ability to bind to the E. faecalis polypeptide-specific antibody can be blocked by the 
E.faecalis polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to 
the E. faecalis polypeptide-specific antibody and can be used to immunize an animal 
to induce formation of further E.faecalis polypeptide-specific antibodies. 

20 Antibodies and fragements thereof of the present invention may be described 

by the portion of a polypeptide of the present invention recognized or specifically 
bound by the antibody. Antibody binding fragements of a polypeptide of the present 
invention may be described or specified in the same manner as for polypeptide 
fragements discussed above., i.e, by N-terminal and C-terminal positions or by size in 

25 contiguous amino acid residues. Any number of antibody binding fragments, of a 
polypeptide of the present invention, specified by N-terminal and C-terminal 
positions or by size in amino acid residues, as described above, may also be excluded 
from the present invention. Therefore, the present invention includes antibodies the 
specifically bind a particuarlly discribed fragement of a polypeptide of the present 
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invention and allows for the exclusion of the same. 

Antibodies and fragements thereof of the present invention may also be 
described or specified in terms of their cross-reactivity. Antibodies and fragements 
that do not bind polypeptides of any other species of Entero coccus other than E. 
5 faecalis are included in the present invention. Likewise, antibodies and fragements 
that bind only species of Enterococcus, i.e. antibodies and fragements that do not bind 
bacteria from any genus other than Enterococcus, are included in the present 
invention. 

1 o Diagnostic Assays 

The present invention further relates to methods for assaying staphylococcal 
infection in an animal by detecting the expression of genes encoding staphylococcal 
polypeptides of the present invention. The methods comprise analyzing tissue or 
body fluid from the animal for Enterococcus-specific antibodies, nucleic acids, or 

15 proteins. Analysis of nucleic acid specific to Enterococcus is assayed by PCR or 
hybridization techniques using nucleic acid sequences of the present invention as 
either hybridization probes or primers. See, e.g., Sambrook et al. Molecular cloning: 
A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 
reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-810 (describing 

20 differentiation among spotted fever group Rickettsiae species by analysis of restriction 
fragment length polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin. 
Microbiol. 32:589-595 (detecting B. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Enterococcus has 
already been made, the present invention is useful for monitoring progression or 

25 regression of the disease state whereby patients exhibiting enhanced Enterococcus 

gene expression will experience a worse clinical outcome relative to patients expressing 
these gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an 
animal, cell line, tissue culture, or other source which contains Enterococcus 
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polypeptide, mRNA, or DNA. Biological samples include body fluids (such as saliva, 
blood, plasma, urine, mucus, synovial fluid, etc.) tissues (such as muscle, skin, and 
cartilage) and any other biological source suspected of containing Enterococcus 
polypeptides or nucleic acids. Methods for obtaining biological samples such as 

5 tissue are well known in the art. 

The present invention is useful for detecting diseases related to Enterococcus 
infections in animals. Preferred animals include monkeys, apes, cats, dogs, birds, 
cows, pigs, mice, horses, rabbits and humans. Particularly preferred are humans. 
Total RNA can be isolated from a biological sample using any suitable 

10 technique such as the single-step guanidinium-thiocyanate-phenol-chloroform method 
described in Chomczynski et al. (1987) Anal. Biochem. 162:156-159. mRNA encoding 
Enterococcus polypeptides having sufficient homology to the nucleic acid sequences 
identified in Table 1 to allow for hybridization between complementary sequences are 
then assayed using any appropriate method. These include Northern blot analysis, SI 

15 nuclease mapping, the polymerase chain reaction (PGR), reverse transcription in 

combination with the polymerase chain reaction (RT-PCR), and reverse transcription 
in combination with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et al. (1990) 
Cell 63:303-3 12. Briefly, total RNA is prepared from a biological sample as described 

20 above. For the Northern blot, the RNA is denatured in an appropriate buffer (such as 
glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel 
electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been 
linked to the filter by a UV linker, the filter is prehybridized in a solution containing 
formamide, SSC, Denhardt's solution, denatured salmon sperm, SDS, and sodium 

25 phosphate buffer. AE.faecalis polynucleotide sequence shown in Table 1 labeled 
according to any appropriate method (such as the 32 P-multiprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization overnight, the filter is 
washed and exposed to x-ray film. DNA for use as probe according to the present 
invention is described in the sections above and will preferably at least 1 5 nucleotides 
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in length. 

SI mapping can be performed as described in Fujita et al. (1987) Cell 
49:357-367. To prepare probe DNA for use in SI mapping, the sense strand of an 
above-described E.faecalis DNA sequence of the present invention is used as a 
5 template to synthesize labeled antisense DNA. The antisense DNA can then be 
digested using an appropriate restriction endonuclease to generate further DNA 
probes of a desired length. Such antisense probes are useful for visualizing protected 
bands corresponding to the target mRNA (i.e., mRNA encoding Enterococcus 
polypeptides). 

10 Levels of mRNA encoding Enterococcus polypeptides are assayed, for e.g., 

using the RT-PCR method described in Makino et al. (1990) Technique 2:295-301. 
By this method, the radioactivities of the "amplicons" in the polyacrylamide gel bands 
are linearly related to the initial concentration of the target mRNA. Briefly, this 
method involves adding total RNA isolated from a biological sample in a reaction 

15 mixture containing a RT primer and appropriate buffer. After incubating for primer 
annealing, the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase 
inhibitor and reverse transcriptase. After incubation to achieve reverse transcription 
of the RNA, the RT products are then subject to PCR using labeled primers. 
Alternatively, rather than labeling the primers, a labeled dNTP can be included in the 

20 PCR reaction mixture. PCR amplification can be performed in a DNA thermal cycler 
according to conventional techniques. After a suitable number of rounds to achieve 
amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide gel. 
After drying the gel, the radioactivity of the appropriate bands (corresponding to the 
mRNA encoding the Enterococcus polypeptides of the present invention) are 

25 quantified using an imaging analyzer. RT and PCR reaction ingredients and 

conditions, reagent and gel concentrations, and labeling methods are well known in the 
art. Variations on the RT-PCR method will be apparent to the skilled artisan. Other 
PCR methods that can detect the nucleic acid of the present invention can be found in 
PCR PRIMER: A LABORATORY MANUAL (C.W. Dieffenbach et al eds., Cold 
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Spring Harbor Lab Press, 1995). 

The polynucleotides of the present invention, including both DNA and RNA, 
may be used to detect polynucleotides of the present invention or Enterococcal 
species including E.faecalis using bio chip technology. The present invention 

5 includes both high density chip arrays (>1000 oligonucleotides per cm 2 ) and low 

density chip arrays (<1000 oligonucleotides per cm 2 ). Bio chips comprising arrays of 
polynucleotides of the present invention may be used to detect Enterococcal species, 
including E. faecalis, in biological and environmental samples and to diagnose an 
animal, including humans, with an E.faecalis or other Enterococcal infection. The bio 

10 chips of the present invention may comprise polynucleotide sequences of other 

pathogens including bacteria, viral, parasitic, and fungal polynucleotide sequences, in 
addition to the polynucleotide sequences of the present invention, for use in rapid 
diffenertial pathogenic detection and diagnosis. The bio chips can also be used to 
monitor an E. faecalis or other Enterococcal infections and to monitor the genetic 

15 changes (deletions, insertions, mismatches, etc.) in response to drug therapy in the 
clinic and drug development in the laboratory. The bio chip technology comprising 
arrays of polynucleotides of the present invention may also be used to simultaneously 
monitor the expression of a multiplicity of genes, including those of the present 
invention. The polynucleotides used to comprise a selected array may be specified in 

20 the same manner as for the fragements, i.e, by their 5' and 3' positions or length in 
contigious base pairs and include from. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E. 
faecalis, using bio chip technology include those known in the art and those of: U.S. 
Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 5527681, 

25 5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, 
WO/95 1 1995, WO/9743447, WO/9535505, each incorporated herein in their 
entireties. 

Biosensors using the polynucleotides of the present invention may also be 
used to detect, diagnose, and monitor E.faecalis or other Enterococcal species and 
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infections thereof. Biosensors using the polynucleotides of the present invention may 
also be used to detect particular polynucleotides of the present invention. Biosensors 
using the polynucleotides of the present invention may also be used to monitor the 
genetic changes (deletions, insertions, mismatches, etc.) in response to drug therapy in 

5 the clinic and drug development in the laboratory. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E. 
faecalis, using biosenors include those known in the art and those of: U.S. Patent Nos 
5721102, 5658732, 5631170, and World Patent Nos. WO97/35011, WO/9720203, 
each incorporated herein in their entireties. 

10 Thus, the present invention includes both bio chips and biosensors comprising 

polynucleotides of the present invention and methods of their use. 

Assaying Enterococcus polypeptide levels in a biological sample can occur 
using any art-known method, such as antibody-based techniques, For example, 
Enterococcus polypeptide expression in tissues can be studied with classical 

15 immunohistological methods. In these, the specific recognition is provided by the 

primary antibody (polyclonal or monoclonal) but the secondary detection system can 
utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a result, an 
immunohistological staining of tissue section for pathological examination is obtained. 
Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of 

20 Enterococcus polypeptides for Western-blot or dot/slot assay. See, e.g., Jalkanen, M. 
et al. (1985) J. Cell. Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell . Biol. 
105:3087-3096. In this technique, which is based on the use of cationic solid phases, 
quantitation of a Enterococcus polypeptide can be accomplished using an isolated 
Enterococcus polypeptide as a standard. This technique can also be applied to body 

25 fluids. 

Other antibody-based methods useful for detecting Enterococcus polypeptide 
gene expression include immunoassays, such as the ELISA and the radioimmunoassay 
(R1A). For example, a Enterococcus polypeptide-specific monoclonal antibodies can 
be used both as an immunoabsorbent and as an enzyme-labeled probe to detect and 
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quantify a Enterococcus polypeptide. The amount of a Enterococcus polypeptide 
present in the sample can be calculated by reference to the amount present in a 
standard preparation using a linear regression computer algorithm. Such an ELISA is 
described in Iacobelli et al. (1988) Breast Cancer Research and Treatment 1 1 : 19-30. In 

5 another ELISA assay, two distinct specific monoclonal antibodies can be used to 

detect Enterococcus polypeptides in a body fluid. In this assay, one of the antibodies 
is used as the immunoabsorbent and the other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or 
"two-step" assay. The "one-step" assay involves contacting the Enterococcus 

10 polypeptide with immobilized antibody and, without washing, contacting the mixture 
with the labeled antibody. The "two-step" assay involves washing before contacting 
the mixture with the labeled antibody. Other conventional methods may also be 
employed as suitable. It is usually desirable to immobilize one component of the 
assay system on a support, thereby allowing other components of the system to be 

15 brought into contact with the component and readily removed from the sample. 
Variations of the above and other immunological methods included in the present 
invention can also be found in Harlow et al, ANTIBODIES: A LABORATORY 
MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, 

20 which catalyze the production of hydrogen peroxide by reacting with substrate. 
Glucose oxidase is particularly preferred as it has good stability and its substrate 
(glucose) is readily available. Activity of an oxidase label may be assayed by 
measuring the concentration of hydrogen peroxide formed by the enzyme-labeled 
antibody/substrate reaction. Besides enzymes, other suitable labels include 

25 radioisotopes, such as iodine ( 125 1, 121 I), carbon ( 14 C), sulphur ( 35 S), tritium ( 3 H), 

indium ( 112 In), and technetium ( 99m Tc), and fluorescent labels, such as fluorescein and 
rhodamine, and biotin. 

Further suitable labels for the Enterococcus polypeptide-specific antibodies of 
the present invention are provided below. Examples of suitable enzyme labels include 
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malate dehydrogenase, Enterococcal nuclease, delta-5-steroid isomerase, yeast-alcohol 
dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, 
peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, 
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and 
5 acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, 11 'in, ,25 1, 13, 1, 32 P, 35 S, 
14 C, 51 Cr, 57 To, 58 Co, 59 Fe, 75 Se, ,52 Eu, 90 Y, 67 Cu, 217 Ci, 211 At, 212 Pb, 47 Sc, 109 Pd, etc. 
1 11 In is a preferred isotope where in vivo imaging is used since its avoids the problem 
of dehalogenation of the 125 I or 13 ^-labeled monoclonal antibody by the liver. In 
10 addition, this radionucleotide has a more favorable gamma emission energy for imaging. 
See, e.g., Perkins et al. (1985) Eur. J. Nucl. Med. 10:296-301; Carasquillo et al. 
(1987) J. Nucl. Med. 28:281-287. For example, nl In coupled to monoclonal 
antibodies with l-(P-isothiocyanatobenzyl)-DPTA has shown little uptake in 
non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 
15 localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870. 

Examples of suitable non-radioactive isotopic labels include 157 Gd, 55 Mn, 
162 Dy, 52 Tr, and 56 Fe. 

Examples of suitable fluorescent labels include an 152 Eu label, a fluorescein 
label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin 
20 label, an allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, 
ricin, and cholera toxin. 

Examples of chemiluminescent labels include a luminal label, an isoluminal 
label, an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an 
25 oxalate ester label, a luciferin label, a luciferase label, and an aequorin label. 

Examples of nuclear magnetic resonance contrasting agents include heavy metal 
nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies are 
provided by Kennedy et al. (1976) Clin. Chim. Acta 70:1-31, and Schurs et al. (1977) 
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Clin. Chim. Acta 8 1 : 1-40. Coupling techniques mentioned in the latter are the 
glutaraldehyde method, the periodate method, the dimaleimide method, the 
m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods are 
incorporated by reference herein. 
5 In a related aspect, the invention includes a diagnostic kit for use in screening 

serum containing antibodies specific against E, faecalis infection. Such a kit may 
include an isolated E. faecalis antigen comprising an epitope which is specifically 
immunoreactive with at least one anti-is. faecalis antibody. Such a kit also includes 
means for detecting the binding of said antibody to the antigen. In specific 

10 embodiments, the kit may include a recombinantly produced or chemically 

synthesized peptide or polypeptide antigen. The peptide or polypeptide antigen 
may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above-described kit 
includes a solid support to which said peptide or polypeptide antigen is attached. 

15 Such a kit may also include a non-attached reporter-labeled anti-human antibody. In 
this embodiment, binding of the antibody to the E. faecalis antigen can be detected by 
binding of the reporter labeled antibody to the anti-is. faecalis polypeptide antibody. 

In a related aspect, the invention includes a method of detecting E. faecalis 
infection in a subject. This detection method includes reacting a body fluid, preferably 

20 serum, from the subject with an isolated E. faecalis antigen, and examining the antigen 
for the presence of bound antibody. In a specific embodiment, the method includes a 
polypeptide antigen attached to a solid support, and serum is reacted with the 
support. Subsequently, the support is reacted with a reporter-labeled anti-human 
antibody. The support is then examined for the presence of reporter-labeled 

25 antibody. 

The solid surface reagent employed in the above assays and kits is prepared 
by known techniques for attaching protein material to solid support material, such as 
polymeric beads, dip sticks, 96-wdl plates or filter material. These attachment 
methods generally include non-specific adsorption of the protein to the support or 
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covalent attachment of the protein , typically through a free amine group, to a 
chemically reactive group on the solid support, such as an activated carboxyl, 
hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in 
conjunction with biotinylated antigen(s). 

5 The polypeptides and antibodies of the present invention, including fragments 

thereof, may be used to detect Enterococcal species including E.faecalis using bio chip 
and biosensor technology. Bio chip and biosensors of the present invention may 
comprise the polypeptides of the present invention to detect antibodies, which 
specifically recognize Enterococcal species, including E.faecalis. Bio chip and 

10 biosensors of the present invention may also comprise antibodies which specifically 
recognize the polypeptides of the present invention to detect Enterococcal species, 
including E.faecalis or specific polypeptides of the present invention. Bio chips or 
biosensors comprising polypeptides or antibodies of the present invention may be 
used to detect Enterococcal species, including E.faecalis, in biological and 

1 5 environmental samples and to diagnose an animal, including humans, with an E. 

faecalis or other Enterococcal infection. Thus, the present invention includes both bio 
chips and biosensors comprising polypeptides or antibodies of the present invention 
and methods of their use. 

The bio chips of the present invention may further comprise polypeptide 

20 sequences of other pathogens including bacteria, viral, parasitic, and fungal 

polypeptide sequences, in addition to the polypeptide sequences of the present 
invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio 
chips of the present invention may further comprise antibodies or fragements thereof 
specific for other pathogens including bacteria, viral, parasitic, and fungal polypeptide 

25 sequences, in addition to the antibodies or fragements thereof of the present invention, 
for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and 
biosensors of the present invention may also be used to monitor an E. faecalis or other 
Enterococcal infection and to monitor the genetic changes (amio acid deletions, 
insertions, substitutions, etc.) in response to drug therapy in the clinic and drug 
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development in the laboratory. The bio chip and biosensors comprising polypeptides 
or antibodies of the present invention may also be used to simultaneously monitor the 
expression of a multiplicity of polypeptides, including those of the present invention. 
The polypeptides used to comprise a bio chip or biosensor of the present invention 
5 may be specified in the same manner as for the fragements, i.e, by their N-terminal and 
C-terminal positions or length in contigious amino acid residue. Methods and 
particular uses of the polypeptides and antibodies of the present invention to detect 
Enterococcal species, including E.faecalis, or specific polypeptides using bio chip and 
biosensor technology include those known in the art, those of the U.S. Patent Nos. 
10 and World Patent Nos. listed above for bio chips and biosensors using 

polynucleotides of the present invention, and those of: U.S. Patent Nos. 5658732, 
5135852, 5567301, 5677196, 5690894 and World Patent Nos. W09729366, 
W096 12957, each incorporated herein in their entireties. 

15 Treatment: 

Agonists and Antagonists - Assays and Molecules 

The invention also provides a method of screening compounds to identify 
those which enhance or block the biological activity of the E.faecalis polypeptides of 
the present invention. The present invention further provides where the compounds 

20 kill or slow the growth of E. faecalis. The ability of E. faecalis antagonists, including 
E.faecalis ligands, to prophylactically or therapeutically block antibiotic resistance 
may be easily tested by the skilled artisan. See, e.g., Straden et al. (1997) J Bacteriol. 
179(1):9-16. 

An agonist is a compound which increases the natural biological function or 
25 which functions in a manner similar to the polypeptides of the present invention, 
while antagonists decrease or eliminate such functions. Potential antagonists include 
small organic molecules, peptides, polypeptides, and antibodies that bind to a 
polypeptide of the invention and thereby inhibit or extinguish its activity. 

The antagonists may be employed for instance to inhibit peptidoglycan cross 
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bridge formation. Antibodies against E.faecalis may be employed to bind to and 
inhibit E.faecalis activity to treat antibiotic resistance. Any of the above antagonists 
may be employed in a composition with a pharmaceutically acceptable carrier. 

5 Vaccines 

The present invention also provides vaccines comprising one or more 

polypeptides of the present invention. Heterogeneity in the composition of a vaccine 

may be provided by combining E.faecalis polypeptides of the present invention. 

Multi-component vaccines of this type are desirable because they are likely to be 
10 more effective in eliciting protective immune responses against multiple species and 

strains of the Enterococcus genus than single polypeptide vaccines. 

Multi-component vaccines are known in the art to elicit antibody production 

to numerous immunogenic components. See, e.g., Decker et al. (1996) J. Infect. Dis. 

174:S270-275. In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent 
15 vaccine has recently been demonstrated to elicit protective levels of antibodies in 

human infants against all four pathogenic agents. See, e.g., Aristegui, J. et al. (1997) 

Vaccine 15:7-9. 

The present invention in addition to single-component vaccines includes 
multi-component vaccines. These vaccines comprise more than one polypeptide, 

20 immunogen or antigen. Thus, a multi-component vaccine would be a vaccine 

comprising more than one of the E.faecalis polypeptides of the present invention. 

Further within the scope of the invention are whole cell and whole viral 
vaccines. Such vaccines may be produced recombinantly and involve the expression 
of one or more of the E.faecalis polypeptides described in Table 1. For example, the 

25 E. faecalis polypeptides of the present invention may be either secreted or localized 
intracellular, on the cell surface, or in the periplasmic space. Further, when a 
recombinant virus is used, the E. faecalis polypeptides of the present invention may, 
for example, be localized in the viral envelope, on the surface of the capsid, or 
internally within the capsid. Whole cells vaccines which employ cells expressing 
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heterologous proteins are known in the art. See, e.g., Robinson, K. et al. (1997) 
Nature Biotech. 15:653-657; Sirard, J. etal. (1997) Infect. Immun. 65:2029-2033; 
Chabalgoity, J. et al. (1997) Infect. Immun. 65:2402-2412 . These cells may be 
administered live or may be killed prior to administration. Chabalgoity, J. et al., supra, 

5 for example, report the successful use in mice of a live attenuated Salmonella vaccine 
strain which expresses a portion of a platyhelminth fatty acid-binding protein as a 
fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques known in 
the art by combining one or more E. faecalis polypeptides of the present invention, or 

10 fragments thereof, with additional non-Enterococcal components (e.g., diphtheria 
toxin or tetanus toxin, and/or other compounds known to elicit an immune response). 
Such vaccines are useful for eliciting protective immune responses to both members of 
the Enterococcus genus and non-Enterococcal pathogenic agents. 

The vaccines of the present invention also include DNA vaccines. DNA 

15 vaccines are currently being developed for a number of infectious diseases. See, et al, 
Boyer, et al. (1997) Nat Med. 3:526-532; reviewed in Spier, R. (1996) Vaccine 
14: 1285-1288. Such DNA vaccines contain a nucleotide sequence encoding one or 
more E. faecalis polypeptides of the present invention oriented in a manner that 
allows for expression of the subject polypeptide. For example, the direct 

20 administration of plasmid DNA encoding 5. burgdorgeri OspA has been shown to 
elicit protective immunity in mice against borrelial challenge. See, Luke et al. (1997) J. 
Infect. Dis. 175:91-97. 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim et al. 

25 (1997) Nature Biotech. 15:641-646, for example, report the enhancement of immune 
responses produced by DNA immunizations when DNA sequences encoding 
molecules which stimulate the immune response are co-administered. In a similar 
fashion, the vaccines of the present invention may be co-administered with either 
nucleic acids encoding immune modulators or the immune modulators themselves. 



WO 98/50554 



-55- 



PCT/US98/08959 



These immune modulators include granulocyte macrophage colony stimulating factor 
(GM-CSF) and CD86. 

The vaccines of the present invention may be used to confer resistance to 
Enterococcal infection by either passive or active immunization. When the vaccines of 

5 the present invention are used to confer resistance to Enterococcal infection through 
active immunization, a vaccine of the present invention is administered to an animal to 
elicit a protective immune response which either prevents or attenuates a Enterococcal 
infection. When the vaccines of the present invention are used to confer resistance to 
Enterococcal infection through passive immunization, the vaccine is provided to a host 

10 animal (e.g., human, dog, or mouse), and the antisera elicited by this antisera is 

recovered and directly provided to a recipient suspected of having an infection caused 
by a member of the Enterococcus genus. 

The ability to label antibodies, or fragments of antibodies, with toxin molecules 
provides an additional method for treating Enterococcal infections when passive 

15 immunization is conducted. In this embodiment, antibodies, or fragments of 

antibodies, capable of recognizing the E.faecalis polypeptides disclosed herein, or 
fragments thereof, as well as other Enterococcus proteins, are labeled with toxin 
molecules prior to their administration to the patient. When such toxin derivatized 
antibodies bind to Enterococcus cells, toxin moieties will be localized to these cells and 

20 will cause their death. 

The present invention thus concerns and provides a means for preventing or 
attenuating a Enterococcal infection resulting from organisms which have antigens that 
are recognized and bound by antisera produced in response to the polypeptides of the 
present invention. As used herein, a vaccine is said to prevent or attenuate a disease if 

25 its administration to an animal results either in the total or partial attenuation (/.e. , 
suppression) of a symptom or condition of the disease, or in the total or partial 
immunity of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for 
either a "prophylactic" or "therapeutic" purpose. When provided prophylactically, 
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the compound(s) are provided in advance of any symptoms of Enterococcal infection. 
The prophylactic administration of the compound(s) serves to prevent or attenuate 
any subsequent infection. When provided therapeutically, the compound(s) is 
provided upon or after the detection of symptoms which indicate that an animal may 

5 be infected with a member of the Enterococcus genus. The therapeutic administration 
of the compound(s) serves to attenuate any actual infection. Thus, the E. faecalis 
polypeptides, and fragments thereof, of the present invention may be provided either 
prior to the onset of infection (so as to prevent or attenuate an anticipated infection) 
or after the initiation of an actual infection. 

10 The polypeptides of the invention, whether encoding a portion of a native 

protein or a functional derivative thereof, may be administered in pure form or may be 
coupled to a macromolecular carrier. Example of such carriers are proteins and 
carbohydrates. Suitable proteins which may act as macromolecular carrier for 
enhancing the immunogenicity of the polypeptides of the present invention include 

15 keyhole limpet hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum 
albumin, and ovalbumin. Methods for coupling the polypeptides of the present 
invention to such macromolecular carriers are disclosed in Harlow et al., 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1988). 

20 A composition is said to be "pharmacologically or physiologically acceptable" 

if its administration can be tolerated by a recipient animal and is otherwise suitable for 
administration to that animal. Such an agent is said to be administered in a 
"therapeutically effective amount" if the amount administered is physiologically 
significant. An agent is physiologically significant if its presence results in a 

25 detectable change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that 
the composition of a pharmacologically acceptable compound varies with the animal 
to which it is administered. For example, a vaccine intended for human use will 
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generally not be co-administered with Freund's adjuvant. Further, the level of purity 
of the E. faecalis polypeptides of the present invention will normally be higher when 
administered to a human than when administered to a non -human animal. 

As would be understood by one of ordinary skill in the art, when the vaccine 
5 of the present invention is provided to an animal, it may be in a composition which 
may contain salts, buffers, adjuvants, or other substances which are desirable for 
improving the efficacy of the composition. Adjuvants are substances that can be used 
to specifically augment a specific immune response.. These substances generally 
perform two functions: (1) they protect the antigen(s) from being rapidly catabolized 

10 after administration and (2) they nonspecifically stimulate immune responses. 

Normally, the adjuvant and the composition are mixed prior to presentation to 
the immune system, or presented separately, but into the same site of the animal being 
immunized. Adjuvants can be loosely divided into several groups based upon their 
composition. These groups include oil adjuvants (for example, Freund's complete and 

1 5 incomplete), mineral salts (for example, A1K(S0 4 ) 2 > AlNa(S0 4 ) 2 , A1NH 4 (S0 4 ), silica, 
kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and 
certain natural substances (for example, wax D from Mycobacterium tuberculosis, as 
well as substances found in Corynebacterium parvum, or Bordetella pertussis, and 
members of the genus Brucella. Other substances useful as adjuvants are the saponins 

20 such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in 
the present invention include aluminum salts, such as A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , and 
A1NH 4 (S0 4 ). Examples of materials suitable for use in vaccine compositions are 
provided in REMINGTON'S PHARMACEUTICAL SCIENCES 1324-1341 (A. 
Osol, ed, Mack Publishing Co, Easton, PA, (1980) (incorporated herein by reference). 

25 The therapeutic compositions of the present invention can be administered 

parenterally by injection, rapid infusion, nasopharyngeal absorption 
(intranasopharangeally), dermoabsorption, or orally. The compositions may 
alternatively be administered intramuscularly, or intravenously. Compositions for 
parenteral administration include sterile aqueous or non-aqueous solutions, 
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suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such 
as ethyl oleate. Carriers or occlusive dressings can be used to increase skin 
permeability and enhance antigen absorption. Liquid dosage forms for oral 

5 administration may generally comprise a liposome solution containing the liquid 
dosage form. Suitable forms for suspending liposomes include emulsions, suspen- 
sions, solutions, syrups, and elixirs containing inert diluents commonly used in the art, 
such as purified water. Besides the inert diluents, such compositions can also include 
adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, 

l o flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization using vaccines encapsulated 
in biodegradable microsphere composed of poly(DL-lactide-co-glycolide). See, 
Shahin, R. et al. (1995) Infect. Immun. 63:1 195-1200. Similarly, orally administered 

15 encapsulated Salmonella typhimurium antigens can also be used. Allaoui-Attarki, K. 
et al. (1997) Infect. Immun. 65:853-857. Encapsulated vaccines of the present 
invention can be administered by a variety of routes including those involving 
contacting the vaccine with mucous membranes (e.g., intranasally, intracolonicly, 
intraduodenally). 

20 Many different techniques exist for the timing of the immunizations when a 

multiple administration regimen is utilized. It is possible to use the compositions of 
the invention more than once to increase the levels and diversities of expression of the 
immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple 
immunizations are given, they will be given one to two months apart. 

25 According to the present invention, an "effective amount" of a therapeutic 

composition is one which is sufficient to achieve a desired biological effect. Generally, 
the dosage needed to provide an effective amount of the composition will vary 
depending upon such factors as the animal f s or human ! s age, condition, sex, and extent 
of disease, if any, and other variables which can be adjusted by one of ordinary skill in 
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the art. 

The antigenic preparations of the invention can be administered by either 
single or multiple dosages of an effective amount. Effective amounts of the 
compositions of the invention can vary from 0.01-1,000 jig/ml per dose, more 
5 preferably 0. 1 -500 Jig/ml per dose, and most preferably 1 0-300 |ig/ml per dose. 

Examples 

Example 1: Isolation of a Selected DNA Clone From the Deposited Sample ofE, 
faecalis 

10 Three approaches can be used to isolate a E. faecalis clone comprising a 

polynucleotide of the present invention from any E, faecalis genomic DNA library. 
The E. faecalis strain V586 has been deposited as a convienent source for obtaining a 
E. faecalis strain although a wide varity of strains E. faecalis strains can be used which 
are known in the art. 

15 E. faecalis genomic DNA is prepared using the following method. A 20ml 

overnight bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain 
Heart Infusion broth or Super broth), pelleted, washed two times with TES (30mM 
Tris-pH 8.0, 25mM EDTA, 50mM NaCl), and resuspended in 5ml high salt TES 
(2.5M NaCl). Lysostaphin is added to final concentration of approx 50ug/ml and the 

20 mixture is rotated slowly 1 hour at 37C to make protoplast cells. The solution is then 
placed in incubator (or place in a shaking water bath) and wanned to 55C. Five 
hundred micro liter of 20% sarcosyl in TES (final concentration 2%) is then added to 
lyse the cells. Next, guanidine HC1 is added to a final concentration of 7M (3.69g in 
5:5 ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear). 

25 A CsCl gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCl and 
overlaying with 2.85M CsCl. The gradient is carefully overlayed with the DNA- 
containing GuHCl solution. The gradient is spun at 30,000 rpm, 20C for 24 hr and 
the lower DNA band is collected. The volume is increased to 5 ml with TE buffer. 
The DNA is then treated with protease K (10 ug/ml) overnight at 37 C, and 
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precipitated with ethanol. The precipitated DNA is resuspended in a desired buffer. 

In the first method, a plasmid is directly isolated by screening a plasmid E. 
faecalis genomic DNA library using a polynucleotide probe corresponding to a 
polynucleotide of the present invention. Particularly, a specific polynucleotide with 
5 30-40 nucleotides is synthesized using an Applied Biosystems DNA synthesizer 
according to the sequence reported. The oligonucleotide is labeled, for instance, with 
32 P-y-ATP using T4 polynucleotide kinase and purified according to routine methods. 
(See, e.g., Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Press, Cold Spring, NY (1982).) The library is transformed into a suitable 

10 host, as indicated above (such as XL-1 Blue (Stratagene)) using techniques known to 
those of skill in the art. See, e.g., Sambrook et al. MOLECULAR CLONING: A 
LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al, 
CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, 
N.Y. 1989). The transformants are plated on 1.5% agar plates (containing the 

15 appropriate selection agent, e.g., ampicillin) to a density of about 150 transformants 
(colonies) per plate. These plates are screened using Nylon membranes according to 
routine methods for bacterial colony screening. See, e.g., Sambrook et al. 
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, 
N.Y. 2nd ed. 1989); Ausubel et al, CURRENT PROTOCALS IN MOLECULAR 

20 BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to those of 
skill in the art. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends 
of a polynucleotide of Table 1 are synthesized and used to amplify the desired DNA 
by PCR using a E. faecalis genomic DNA prep as a template. PCR is carried out 
25 under routine conditions, for instance, in 25 \i\ of reaction mixture with 0.5 ug of the 
above DNA template. A convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01% 
(w/v) gelatin, 20 fiM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 
0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94°C for 1 
min; annealing at 55°C for 1 min; elongation at 72°C for 1 min) are performed with a 
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Perkin-Elmer Cetus automated theraial cycler. The amplified product is analyzed by 
agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PGR product is verified to be the selected sequence by 
subcloning and sequencing the DNA product. 
5 Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically 

synthesized and used to generate a nucleotide sequence of desired length using PCR 
methods known in the art. 

Example 2(a): Expression and Purification Enterococcal polypeptides in E. coli 

l o The bacterial expression vector pQE60 was used for bacterial expression of 

some of the polypeptide fragements used in the soft tissue and systemic infection 
models discussed below. (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains a 
bacterial origin of replication ( M ori"), an IPTG inducible promoter, a ribosome binding 

1 5 site ("RBS"), six codons encoding histidine residues that allow affinity purification 
using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) 
and suitable single restriction enzyme cleavage sites. These elements are arranged such 
that an inserted DNA fragment encoding a polypeptide expresses that polypeptide 
with the six His residues (i.e., a "6 X His tag M ) covalently linked to the carboxyl 

20 terminus of that polypeptide. 

The DNA sequence encoding the desired portion of a E. faecalis protein of the 
present invention was amplified from E. faecalis genomic DNA using PCR 
oligonucleotide primers which anneal to the 5' and 3' sequences coding for the 
portions of the E. faecalis polynucleotide shown in Table 1. Additional nucleotides 

25 containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' 
and 3* sequences, respectively. 

For cloning the mature protein, the 5' primer has a sequence containing an 
appropriate restriction site followed by nucleotides of the amino terminal coding 
sequence of the desired E. faecalis polynucleotide sequence in Table 1 . One of 
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ordinary skill in the art would appreciate that the point in the protein coding sequence 
where the 5' and 3' primers begin may be varied to amplify a DNA segment encoding 
any desired portion of the complete protein shorter or longer than the mature form. 
The 3 ! primer has a sequence containing an appropriate restriction site followed by 
5 nucleotides complementary to the 3 f end of the polypeptide coding sequence of Table 
1 , excluding a stop codon, with the coding sequence aligned with the restriction site so 
as to maintain its reading frame with that of the six His codons in the pQE60 vector. 

The amplified E.faecalis DNA fragment and the vector pQE60 were digested 
with restriction enzymes which recognize the sites in the primers and the digested 

10 DNAs were then ligated together. The E.faecalis DNA was inserted into the 

restricted pQE60 vector in a manner which places the E.faecalis protein coding region 
downstream from the IPTG-inducible promoter and in-frame with an initiating AUG 
and the six histidine codons. 

The ligation mixture was transformed into competent/?, coli cells using 

15 standard procedures such as those described by Sambrook et ah, supra.. E. coli strain 
M15/rep4, containing multiple copies of the plasmid pREP4, which expresses the lac 
repressor and confers kanamycin resistance ("Kanr"), was used in carrying out the 
illustrative example described herein. This strain, which was only one of many that 
are suitable for expressing a E.faecalis polypeptide, is available commercially 

20 (QIAGEN, Inc., supra). Transformants were identified by their ability to grow on LB 
agar plates in the presence of ampicillin and kanamycin. Plasmid DNA was isolated 
from resistant colonies and the identity of the cloned DNA confirmed by restriction 
analysis, PCR and DNA sequencing. 

Clones containing the desired constructs were grown overnight ("O/N") in 

25 liquid culture in LB media supplemented with both ampicillin (100 |ig/ml) and 
kanamycin (25 fig/ml). The O/N culture was used to inoculate a large culture, at a 
dilution of approximately 1:25 to 1:250. The cells were grown to an optical density at 
600 run ("00600") of between 0.4 and 0,6. Isopropyl-p-D-thiogalactopyranoside 
("IPTG") was then added to a final concentration of 1 mM to induce transcription 
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from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequently were incubated further for 3 to 4 hours. Cells then were harvested by 
centrifugation. 

The cells were then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. 
5 The cell debris was removed by centrifugation, and the supernatant containing the E. 
faecalis polypeptide was loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") 
affinity resin column (Q1AGEN, Inc., supra). Proteins with a 6 x His tag bind to the 
Ni-NTA resin with high affinity were purified in a simple one-step procedure (for 
details see: The QIAexpressionist, 1995, QIAGEN, Inc., supra). Briefly the 
10 supernatant was loaded onto the column in 6 M guanidine-HCl, pH 8, the column was 
first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed with 10 
volumes of 6 M guanidine-HCl pH 6, and finally the E. faecalis polypeptide was 
eluted with 6 M guanidine-HCl, pH 5. 

The purified protein was then renatured by dialyzing it against 
1 5 phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM 
NaCl. Alternatively, the protein could be successfully refolded while immobilized on 
the Ni-NTA column. The recommended conditions are as follows: renature using a 
linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 
containing protease inhibitors. The renaturation should be performed over a period of 
20 1.5 hours or more. After renaturation the proteins can be eluted by the addition of 
250 mM immidazole. Immidazole was removed by a final dialyzing step against PBS 
or 50 mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein was 
stored at 4° C or frozen at -80° C. 

Some of the polypeptide of the present invention were prepared using a non- 
25 denaturing protein purification method. For these polypeptides, the cell pellet from 
each liter of culture was resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer 
A = 50 mM Na-phosphate, 300 mM NaCl, 10 mM 2-mercaptoethanol, 10% 
Glycerol, pH 7.5 with 1 tablet of Complete EDTA-free protease inhibitor cocktail 
(Boehringer Mannheim #1873580) per 50 ml of buffer). Absorbance at 550 nm was 
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approximately 10-20 O.D./ml. The suspension was then put through three 
freeze/thaw cycles from -70°C (using a ethanol-dry ice bath) up to room temperature. 
The cells were lysed via sonication in short 10 sec bursts over 3 minutes at 
approximately 80W while kept on ice. The sonicated sample was then centrifuged at 

5 15,000 RPM for 30 minutes at 4°C. The supernatant was passed through a column 
containing 1 .0 ml of CL-4B resin to pre-clear the sample of any proteins that may 
bind to agarose non-specifically, and the flow-through fraction was collected. 

The pre-cleared flow-through was applied to a nickel-nitrilo-tri-acetic acid 
("Ni-NTA") affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag 

10 bind to the Ni-NTA resin with high affinity and can be purified in a simple one-step 
procedure. Briefly, the supernatant was loaded onto the column in Lysis Buffer A at 
4°C, the column was first washed with 10 volumes of Lysis Buffer A until the A280 
of the eluate returns to the baseline. Then, the column was washed with 5 volumes of 
40 mM Imidazole (92% Lysis Buffer A / 8% Buffer B) (Buffer B = 50 mM Na- 

15 Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM 2-mercaptoethanol, 500 mM 
Imidazole, pH of the final buffer should be 7.5). The protein was eluted off of the 
column with a series of increasing Imidazole solutions made by adjusting the ratios of 
Lysis Buffer A to Buffer B. Three different concentrations were used: 3 volumes of 
75 mM Imidazole, 3 volumes of 150 mM Imidazole, 5 volumes of 500 mM 

20 Imidazole. The fractions containing the purified protein were analyzed using 8 %, 10 
% or 14% SDS-PAGE depending on the protein size. The purified protein was then 
dialyzed 2X against phosphate-buffered saline (PBS) in order to place it into an easily 
workable buffer. The purified protein was stored at 4° C or frozen at -80°. 

The following alternative method may be used to purify E. faecalis expressed 

25 in E coll when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-1 0°C and the cells are harvested by continuous centrifxigation at 
1 5,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 
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unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

5 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

10 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 1 5 min., the 
pellet is discarded and the E. faecalis polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

15 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 12 hours prior to further purification steps. 

To clarify the refolded E. faecalis polypeptide solution, a previously prepared 

20 tangential filtration unit equipped with 0.16 \im membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 

25 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the E. faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 
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Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins. 
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is 
then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 
5 mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. 
Fractions are collected under constant A 2 go monitoring of the effluent. Fractions 
containing the E.faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant E.faecalis polypeptide exhibits greater than 95% purity after 
10 the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 16% SDS-PAGE gel when 5 \ig of purified protein is 
loaded. The purified protein is also tested for endotoxin/LPS contamination, and 
typically the LPS content is less than 0.1 ng/ml according to LAL assays. 

1 5 Example 2(b): Alternative Expression and Purification Enterococcal polypeptides in E. 
coli 

Tthe vector pQElO was alternatively used to clone and express some of the 
polypeptides of the present invention for use in the soft tissue and systemic infection 
models discussed below. The difference being such that an inserted DNA fragment 

20 encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a "6 
X His tag") covalently linked to the amino terminus of that polypeptide. The bacterial 
expression vector pQElO (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311) was used in this example . The components of the pQElO plasmid are 
arranged such that the inserted DNA sequence encoding a polypeptide of the present 

25 invention expresses the polypeptide with the six His residues (i.e., a "6 X His tag")) 
covalently linked to the amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of Table 
1 were amplified using PCR oligonucleotide primers from genomic E.faecalis DNA. 
The PCR primers anneal to the nucleotide sequences encoding the desired amino acid 



WO 98/50554 



-67- 



PCT/US98/08959 



sequence of a polypeptide of the present invention. Additional nucleotides containing 
restriction sites to facilitate cloning in the pQE 10 vector were added to the 5' and 3 1 
primer sequences, respectively. 

For cloning a polypeptide of the present invention, the 5' and 3' primers were 
5 selected to amplify their respective nucleotide coding sequences. One of ordinary, skill 
in the art would appreciate that the point in the protein coding sequence where the 5' 
and 3' primers begins may be varied to amplify a DNA segment encoding any desired 
portion of a polypeptide of the present invention. The 5' primer was designed so the 
coding sequence of the 6 X His tag is aligned with the restriction site so as to maintain 
10 its reading frame with that of E. faecalis polypeptide. The 3' was designed to include 
an stop codon. The amplified DNA fragment was then cloned, and the protein 
expressed, as described above for the pQE60 plasmid. 

The DNA sequences encoding the amino acid sequences of Table 1 may also 
be cloned and expressed as fusion proteins by a protocol similar to that described 
15 directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, 
Madison, WI 5371 1) is preferentially used in place of pQElO. 

The above methods are not limited to the polypeptide fragements actually 
produced. The above method, like the methods below, can be used to produce either 
full length polypeptides or desired fragements therof. 

20 

Example 2(c): Alternative Expression and Purification of Enterococcal polypeptides 
in E. coli 

The bacterial expression vector pQE60 is used for bacterial expression in this 
example (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 9131 1). However, in 
25 this example, the polypeptide coding sequence is inserted such that translation of the 
six His codons is prevented and, therefore, the polypeptide is produced with no 6 X 
His tag. 

The DNA sequence encoding the desired portion of the E \ faecalis amino acid 
sequence is amplified from an E. faecalis genomic DNA prep the deposited DNA 
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clones using PCR oligonucleotide primers which anneal to the 5' and 3' nucleotide 
sequences corresponding to the desired portion of the E. faecalis polypeptides. 
Additional nucleotides containing restriction sites to facilitate cloning in the pQE60 
vector are added to the 5 ! and 3 f primer sequences. 
5 For cloning a E. faecalis polypeptides of the present invention, 5' and 3' 

primers are selected to amplify their respective nucleotide coding sequences. One of 
ordinary skill in the art would appreciate that the point in the protein coding sequence 
where the 5* and 3 1 primers begin may be varied to amplify a DNA segment encoding 
any desired portion of a polypeptide of the present invention. The 3' and 5' primers 

10 contain appropriate restriction sites followed by nucleotides complementary to the 5' 
and 3' ends of the coding sequence respectively. The 3' primer is additionally designed 
to include an in-frame stop codon. 

The amplified E. faecalis DNA fragments and the vector pQE60 are digested 
with restriction enzymes recognizing the sites in the primers and the digested DNAs 

1 5 are then ligated together. Insertion of the E. faecalis DNA into the restricted pQE60 
vector places the E. faecalis protein coding region including its associated stop codon 
downstream from the IPTG-inducible promoter and in-frame with an initiating AUG. 
The associated stop codon prevents translation of the six histidine cddons 
downstream of the insertion point. 

20 The ligation mixture is transformed into competent E. coli cells using standard 

procedures such as those described by Sambrook et al. E. coli strain M15/rep4, 
containing multiple copies of the plasmid pREP4, which expresses the lac repressor 
and confers kanamycin resistance ("Kanr"), is used in carrying out the illustrative 
example described herein. This strain, which is only one of many that are suitable for 

25 expressing E. faecalis polypeptide, is available commercially (QIAGEN, Inc., supra). 
Transformants are identified by their ability to grow on LB plates in the presence of 
ampicillin and kanamycin. Plasmid DNA is isolated from resistant colonies and the 
identity of the cloned DNA confirmed by restriction analysis, PCR and DNA 
sequencing. 
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Clones containing the desired constructs are grown overnight ("O/N") in liquid 
culture in LB media supplemented with both ampicillin (100 pig/ml) and kanamycin 
(25 |ig/ml). The O/N culture is used to inoculate a large culture, at a dilution of 
approximately 1 :25 to 1 :250. The cells are grown to an optical density at 600 nm 

5 ("OD600") of between 0.4 and 0.6. isopropyl-b-D-thiogalactopyranoside ("1PTG") is 
then added to a final concentration of 1 mM to induce transcription from the lac 
repressor sensitive promoter, by inactivating the lad repressor. Cells subsequently 
are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the E. faecalis polypeptide, the cells are then stirred for 3-4 hours at 

10 4°C in 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the 
supernatant containing the E. faecalis polypeptide is dialyzed against 50 mM Na- 
acetate buffer pH 6, supplemented with 200 mM NaCl. Alternatively, the protein 
can be successfully refolded by dialyzing it against 500 mM NaCl, 20% glycerol, 25 
mM Tris/HCl pH 7.4, containing protease inhibitors. After renaturation the protein 

1 5 can be purified by ion exchange, hydrophobic interaction and size exclusion 

chromatography. Alternatively, an affinity chromatography step such as an antibody 
column can be used to obtain pure E. faecalis polypeptide. The purified protein is 
stored at 4° C or frozen at -80° C. 

The following alternative method may be used to purify E. faecalis 

20 polypeptides expressed in E coli when it is present in the form of inclusion bodies. 
Unless otherwise specified, all of the following steps are conducted at 4-1 0°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-1 0°C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 

25 unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells ware then lysed by passing the solution through a microfluidizer 
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(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

5 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 15 min., the 
pellet is discarded and the E. faecalis polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

10 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 1 2 hours prior to further purification steps. 

To clarify the refolded E. faecalis polypeptide solution, a previously prepared 

1 5 tangential filtration unit equipped with 0. 1 6 \im membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 

20 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the E, faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 

25 Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins. 
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is 
then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 
mM sodium acetate, pH 6.0 to 1.0 M NaCl, 50 mM sodium acetate, pH 6.5. 
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Fractions are collected under constant A 2 go monitoring of the effluent. Fractions 
containing the E.faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant E.faecalis polypeptide exhibits greater than 95% purity after 
5 the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 16% SDS-PAGE gel when 5 (ig of purified protein is 
loaded. The purified protein is also tested for endotoxin/LPS contamination, and 
typically the LPS content is less than 0. 1 ng/ml according to LAL assays. 

1 0 Example 2(d): Cloning and Expression of E. faecalis in Other Bacteria 

E.faecalis polypeptides can also be produced in: E.faecalis using the methods 
of S. Skinner et al., (1988) MoL Microbiol. 2:289-297 or J. 1. Moreno (1996) Protein 
Expr. Purif. 8(3):332-340; Lactobacillus using the methods of C Rush et al., 1997 
Appl. Microbiol. Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods 

15 Chang et al., U.S. Patent No. 4,952,508. 

Example 3: Cloning and Expression in COS Cells 

A E. faecalis expression plasmid is made by cloning a portion of the DNA 
encoding a E.faecalis polypeptide into the expression vector pDNAI/Amp or 

20 pDNAIII (which can be obtained from Invitrogen, Inc.). The expression vector 

pDNAI/amp contains: (1) an E. coli origin of replication effective for propagation in 
E. coli and other prokaryotic cells; (2) an ampicillin resistance gene for selection of 
plasmid-containing prokaryotic cells; (3) an SV40 origin of replication for propagation 
in eukaryotic cells; (4) a CMV promoter, a polylinker, an SV40 intron; (5) several 

25 codons encoding a hemagglutinin fragment (i.e., an "HA" tag to facilitate purification) 
followed by a termination codon and polyadenylation signal arranged so that a DNA 
can be conveniently placed under expression control of the CMV promoter and 
operably linked to the SV40 intron and the polyadenylation signal by means of 
restriction sites in the polylinker. The HA tag corresponds to an epitope derived 



WO 98/50554 



-72- 



PCT/US98/08959 



from the influenza hemagglutinin protein described by Wilson et al. 1984 Cell 37:767. 
The fusion of the HA tag to the target protein allows easy detection and recovery of 
the recombinant protein with an antibody that recognizes the HA epitope. pDNAIH 
contains, in addition, the selectable neomycin marker. 
5 A DNA fragment encoding a E. faecalis polypeptide is cloned into the 

polylinker region of the vector so that recombinant protein expression is directed by 
the CMV promoter. The plasmid construction strategy is as follows. The DNA from 
a E. faecalis genomic DNA prep is amplified using primers that contain convenient 
restriction sites, much as described above for construction of vectors for expression of 

10 E. faecalis in E. coli. The 5 1 primer contains a Kozak sequence, an AUG start codon, 
and nucleotides of the 5' coding region of the E. faecalis polypeptide. The 3 ! primer, 
contains nucleotides complementary to the 3 f coding sequence of the E. faecalis DNA, 
a stop codon, and a convenient restriction site. 

The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested 

15 with appropriate restriction enzymes and then ligated. The ligation mixture is 

transformed into an appropriate E. coli strain such as SURE™ (Stratagene Cloning 
Systems, La Jolla, CA 92037), and the transformed culture is plated on ampicillin 
media plates which then are incubated to allow growth of ampicillin resistant colonies. 
Plasmid DNA is isolated from resistant colonies and examined by restriction analysis 

20 or other means for the presence of the fragment encoding the E. faecalis polypeptide 
For expression of a recombinant E. faecalis polypeptide, COS cells are 
transfected with an expression vector, as described above, using DEAE-dextran, as 
described, for instance, by Sambrook et ah {supra). Cells are incubated under 
conditions for expression of E. faecalis by the vector. 

25 Expression of the E. faecalis-HA fusion protein is detected by radiolabeling 

and immunoprecipitation, using methods described in, for example Harlow et al., 
supra.. To this end, two days after transfection, the cells are labeled by incubation in 
media containing 35 S-cysteine for 8 hours. The cells and the media are collected, and 
the cells are washed and the lysed with detergent-containing RIPA buffer: 1 50 mM 
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NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TR1S, pH 7.5, as 
described by Wilson et al. {supra ). Proteins are precipitated from the cell lysate and 
from the culture media using an H A-specific monoclonal antibody. The precipitated 
proteins then are analyzed by SDS-PAGE and autoradiography. An expression 
5 product of the expected size is seen in the cell lysate, which is not seen in negative 
controls. 

Example 4: Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of E.faecalis polypeptide in this 

1 0 example. Plasmid pC4 is a derivative of the plasmid pS V2-dhfr (ATCC Accession 
No. 37146). The plasmid contains the mouse DHFR gene under control of the SV40 
early promoter. Chinese hamster ovary cells or other cells lacking dihydrofolate 
activity that are transfected with these plasmids can be selected by growing the cells 
in a selective medium (alpha minus MEM, Life Technologies) supplemented with the 

15 chemotherapeutic agent methotrexate. The amplification of the DHFR genes in cells 
resistant to methotrexate (MTX) has been well documented. See, e.g., Alt et al., 
1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. Acta, 
1097:107-143; Page et al., 1991, Biotechnology 9:64-68. Cells grown in increasing 
concentrations of MTX develop resistance to the drug by overproducing the target 

20 enzyme, DHFR, as a result of amplification of the DHFR gene. If a second gene is 
linked to the DHFR gene, it is usually co-amplified and over-expressed. It is known 
in the art that this approach may be used to develop cell lines carrying more than 
1,000 copies of the amplified gene(s). Subsequently, when the methotrexate is 
withdrawn, cell lines are obtained which contain the amplified gene integrated into one 

25 or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) 
of the Rouse Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al. 
(1985) Mol. Cell. Biol. 5:438-447; plus a fragment isolated from the enhancer of the 
immediate early gene of human cytomegalovirus (CMV), Boshart, et al., 1985, Cell 



WO 98/50554 



-74- 



PCT/US98/08959 



41 :52 1-530. Downstream of the promoter are the following single restriction enzyme 
cleavage sites that allow the integration of the genes: Bam HI, Xba I, and Asp 718. 
Behind these cloning sites the plasmid contains the 3 f intron and polyadenylation site 
of the rat preproinsulin gene. Other high efficiency promoters can also be used for the 

5 expression, e.g., the human B-actin promoter, the SV40 early or late promoters or the 
long terminal repeats from other retroviruses, e.g., HIV and HTLVI. Clontech's Tet- 
Off and Tet-On gene expression systems and similar systems can be used to express 
the E. faecalis polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, 
Proc. Natl. Acad. Sci. USA 89:5547-5551 . For the polyadenylation of the mRNA 

10 other signals, e.g., from the human growth hormone or globin genes can be used as 
well. Stable cell lines carrying a gene of interest integrated into the chromosomes can 
also be selected upon co-transfection with a selectable marker such as gpt, G418 or 
hygromycin. It is advantageous to use more than one selectable marker in the 
beginning, e.g., G4 18 plus methotrexate. 

15 The plasmid pC4 is digested with the restriction enzymes and then 

dephosphorylated using calf intestinal phosphates by procedures known in the art. 
The vector is then isolated from a 1% agarose gel. The DNA sequence encoding the E. 
faecalis polypeptide is amplified using PCR oligonucleotide primers corresponding to 
the 5 1 and 3' sequences of the desired portion of the gene. A 5 ! primer containing a 

20 restriction site, a Kozak sequence, an AUG start codon, and nucleotides of the 5' 
coding region of the E. faecalis polypeptide is synthesized and used. A 3 f primer, 
containing a restriction site, stop codon, and nucleotides complementary to the 3' 
coding sequence of the E. faecalis polypeptides is synthesized and used. The 
amplified fragment is digested with the restriction endonucleases and then purified 

25 again on a 1% agarose gel. The isolated fragment and the dephosphorylated vector are 
then ligated with T4 DNA ligase. E. coli HB101 or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 
pC4 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for 
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transfection. Five \i% of the expression plasmid pC4 is cotransfected with 0.5 |ig of 
the plasmid pSVneo using a lipid-mediated transfection agent such as Lipofectin™ or 
LipofectAMINE.™ (LifeTechnologies Gaithersburg, MD). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme 

5 that confers resistance to a group of antibiotics including G41 8. The cells are seeded 
in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha 
minus MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml 
G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well 

10 petri dishes or 1 0 ml flasks using different concentrations of methotrexate (50 nM, 

1 00 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
concentrations of methotrexate (1 jlM, 2 pM, 5 |iM, 10 mM, 20 mM). The same 
procedure is repeated until clones are obtained which grow at a concentration of 

15 1 00-200 \iM. Expression of the desired gene product is analyzed, for instance, by 
SDS-PAGE and Western blot or by reversed phase HPLC analysis. 

Example 5: Quantitative Murine Soft Tissue Infection Model for E. faecalis 

Compositions of the present invention, including polypeptides and peptides, 

20 are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E. faecalis) using the following quantitative 
murine soft tissue infection model. Mice (e.g., NIH Swiss female mice, approximately 
7 weeks old) are first treated with a biologically protective effective amount, or 
immune enhancing/stimulating effective amount of a composition of the present 

25 invention using methods known in the art, such as those discussed above. See,e.g. f 
Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor 
Laboratory Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug 
per animal. 
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The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10 8 
cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 with sterilized Cytodex 3 microcarrier beads preswollen 

5 in sterile PBS (3g/100ml). Mice are anesthetize briefly until docile, but still mobile 
and injected with 0.2 ml of the Cytodex 3 bead/bacterial mixture into each animal 
subcutaneously in the inguinal region. After four days, counting the day of injection 
as day one, mice are sacrificed and the contents of the abscess is excised and placed in 
a 15 ml conical tube containing 1.0ml of sterile PBS. The contents of the abscess is 

10 then enzymatically treated and plated as follows. 

The abscess is first disrupted by vortexing with sterilized glass beads placed in 
the tubes. 3.0mls of prepared enzyme mixture (1.0ml Collagenase D (4.0 mg/ml), 
1 .0ml Trypsin (6.0 mg/ml) and 8.0 mis PBS) is then added to each tube followed by a 
20 min. incubation at 37C. The solution is then centrifuged and the supernatant 

15 drawn off. 0.5 ml dH20 is then added and the tubes are vortexed and then incubated 
for 10 min. at room temperature. 0.5 ml media is then added and samples are serially 
diluted and plated onto agar plates, and grown overnight at 37C Plates with distinct 
and separate colonies are then counted, compared to positive and negative control 
samples, and quantified. The method can be used to identify composition and 

20 determine appropriate and effective doses for humans and other animals by comparing 
the effective doses of compositions of the present invention with compositions 
known in the art to be effective in both mice and humans. Doses for the effective 
treatment of humans and other animals, using compositions of the present invention, 
are extrapolated using the data from the above experiments of mice. It is appreciated 

25 that further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art. 

Example 6: Murine Systemic Neutropenic Model for E. faecalis Infection 
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Compositions of the present invention, including polypeptides and peptides, 
are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E.faecalis) using the following qualitative murine 
systemic neutropenic model. Mice (e.g., NIH Swiss female mice, approximately 7 

5 weeks old) are first treated with a biologically protective effective amount, or immune 
enhancing/stimulating effective amount of a composition of the present invention 
using methods known in the art, such as those discussed above. See,e.g., Harlow et 
al. f ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug per animal. 

10 Mice are then injected with 250 - 300 mg/kg cyclophosphamide intraperitonially. 

Counting the day of CP. injection as day one, the mice are left untreated for 5 days to 
begin recovery of PMNL'S. 

The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10 8 

15 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1:2 in 4% Brewer's yeast in media. 
Mice are injected with the bacteria/brewer's yeast challenge intraperitonially. The 
Brewer's yeast solution alone is used as a control. The mice are then monitered twice 
daily for the first week following challenge, and once a day for the next week to 

20 ascertain morbidity and mortality. Mice remaining at the end of the experiment are 
sacrificed. The method can be used to identify compositions and determine 
appropriate and effective doses for humans and other animals by comparing the 
effective doses of compositions of the present invention with compositions known in 
the art to be effective in both mice and humans. Doses for the effective treatment of 

25 humans and other animals, using compositions of the present invention, are 

extrapolated using the data from the above experiments of mice. It is appreciated that 
further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art 
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The disclosure of all publications (including patents, patent applications, 
journal articles, laboratory manuals, books, or other documents) cited herein are 
hereby incorporated by reference in their entireties. 

The present invention is not to be limited in scope by the specific 

5 embodiments described herein, which are intended as single illustrations of individual 
aspects of the invention. Functionally equivalent methods and components arc within 
the scope of the invention, in addition to those shown and described herein and will 
become apparant to those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of 

10 the appended claims. 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 



EF001-1 (SEQ ID NO:l) 



TGAAAGAATA TTGCCAGAAC GTGGCGAGCA 
AAAAAAATGA AGTTCAAAAC TCTAGCAACA 
TTGGGGGCTT GTGGTAACGG TAATGGGGCC 
AAGGAAGATA CGACAATCAC TTTCTGGCAT 
ACAAAATTAA CGAAAGACTT CATGAAAGAA 
CAATCTGCTT ACCCTGATTT AC AAGC C AAA 
TTACCAACAA TTACGCAAGC GTACCCAGGC 
TTAGTGGACT TAAAACCATA TATGGATGAT 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA 
AATAAATCGA CAGAAATGTT ATTCTATAAT 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA 
AAAGAAGTCG TTGGTGCTGG TTTTGACTCG 
AACAAAGGCG TTGATTTTAA TAAAGACTTA 
GTGGACTATT ACCGTGATGG TATCGAAGCA 
TATTTATCTG GCCCATTTGC AAACAAAAAG 
GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT 
CCTGAAAAAA TCAACTTACA ACAAGGAACA 
GAACAACGGA CAGCGGCATT TGAATTCATG 

. TACTGGGCAC AACAAACAGG TTATATGCCA 
TACAAAAATT CTAAGACAAC CAAAGTACCT 

• TTCGCTATCC CAGTAGAAGA AAATGC TGAT 
GAAAGTATTT TTGCTTCATC AAATAAAGAC 
CAATTTGAAC AAGCATGGAA CCAATAA 



AATTGTTTTA TAAATTTTTT TAAGGGAGAG 
ACAGTGTTAG CAACCGCAGC TATTTTCGCA 
AAAGAATCAA ACGATATTGT GAAAGAAGTG 
GCAATGAATG GGGTTCAAGA AGAAGCGTTA 
AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
ATCAATTCGA .CTTTAACTTC ACCAAAAGAT 
TGGTTATGGA ATGCTGCACA AGATGAAATG 
GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATCGACGGCA AACAATACGG CATTCCATTT 
GCTGATTTGT TGAAAGAATA TGGTGTTGAA 
GCTTCTAAAA CAATTTACGA AAAATCCAAC 
TTAAATAACT ATTACGCAAT TGGAATGAAA 
GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GGTTACTTCC GCACAGCTGG TTCAGATAAA 
GTAGCAATGT TTGTCGGTAG TATTGCTGGT 
GGTGGCTATG AATACGGTGT TGCACCACGT 
GATATTTATA TGTTCGATAG TGCTACGCCA 
AAATTCTTAG CTACTCCTGA TTCACAATTG 
ATTTTAGAAT CTGTTTTACA CAGTGATGAG 
GCACAACTTG AAAACGCAGT AAAAGATTTA 
TCAGCCTATA ATGAAATGCG GACAATTATG 
ACGAGAAAAT TATTGAAAGA TGCAACATCA 



EF001-2 (SEQ ID NO : 2 ) 

MKFKTLATT VLATAAIFAL GACGNGNGAK ESNDIVKEVK 

EDTTITFWHA MNGVQEEALT KLTKDFMKEN PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL 
PTITQAYPGW LWNAAQDEML VDLKPYMDDD TIGWKDAEPI REVLLDGAKI DGKQYGIPFN 
KSTEMLFYNA DLLKEYGVEV PKTLEELKEA SKTIYEKSNK EWGAGFDSL NNYYAIGMKN 
KGVDFNKDLD LTSKDSQEW DYYRDGIEAG YFRTAGSDKY LSGPFANKKV AMFVGS I AGA 
GFVQKDAEAG GYEYGVAPRP EKINLQQGTD IYMFDSATPE QRTAAFEFMK FLATPDSQLY 
WAQQTGYMPI LESVLHSDEY KNSKTTKVPA QLENAVKDLF AIPVEENADS AYNEMRTIME 
SIFASSNKDT RKLLKDATSQ FEQAWNQ 



EF001-3 (SEQ ID NO:3) 

TT GTGGTAACGG TAATGGGGCC AAAGAATCAA ACGATATTGT GAAAGAAGTG 
AAGGAAGATA CGACAATCAC TTTCTGGCAT GCAATGAATG GGGTTCAAGA AGAAGCGTTA 
ACAAAATTAA CGAAAGACTT CATGAAAGAA AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
CAATCTGCTT ACCCTGATTT ACAAGCCAAA ATCAATTCGA CTTTAACTTC ACCAAAAGAT 
TTACCAACAA TTACGCAAGC GTACCCAGGC TGGTTATGGA ATGCTGCACA AGATGAAATG 
TTAGTGGACT TAAAACCATA TATGGATGAT GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA ATCGACGGCA AACAATACGG CATTCCATTT 
AATAAATCGA CAGAAATGTT ATTCTATAAT GCTGATTTGT TGAAAGAATA TGGTGTTGAA" 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA GCTTCTAAAA CAATTTACGA AAAATCCAAC 
AAAGAAGTCG TTGGTGCTGG TTTTGACTCG TTAAATAACT ATTACGCAAT TGGAATGAAA 
AACAAAGGCG TTGATTTTAA TAAAGACTTA GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GTGGACTATT ACCGTGATGG TATCGAAGCA GGTTACTTCC GCACAGCTGG . TTCAGATAAA 
TATTTATCTG GCCCATTTGC AAACAAAAAG GTAGCAATGT TTGTCGGTAG TATTGCTGGT 
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GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT GGTGGCTATG AATACGGTGT TGCACCACGT 
CCTGAAAAAA TCAACTTACA ACAAGGAACA GATATTTATA TGTTCGATAG TGCTACGCCA 
GAACAACGGA CAGCGGCATT TGAATTCATG AAATTCTTAG CTACTCCTGA TTCACAATTG 
TACTGGGCAC AACAAACAGG TTATATGCCA ATTTTAGAAT CTGTTTTACA CAGTGATGAG 
TACAAAAATT CTAAGACAAC CAAAGTACCT GCACAACTTG AAAACGCAGT AAAAGATTTA 
TTCGCTATCC CAGTAGAAGA AAATGCTGAT TCAGCCTATA ATGAAATGCG GACAATTATG 
GAAAGTATTT TTGCTTCATC AAATAAAGAC ACGAGAAAAT TATTGAAAGA TGCAACATGA 
CAATTTGAAC AAGCATGGAA CCAA 



EF001-4 (SEQ ID NO : 4 ) 

CGNGNGAK ESNDIVKEVK 
EDTTITFWHA MNGVQEEALT KLTKDFMKEN 
PTITQAYPGW LWNAAQDEML VDLKPYMDDD 
KSTEMLFYNA DLLKEYGVEV PKTLEELKEA 
KGVDFNKDLD LTSKDSQEW DYYRDGIEAG 
GFVQKDAEAG GYEYGVAPRP EKINLQQGTD 
WAQQTGYMPI LESVLHSDEY KNSKTTKVPA 
SIFASSNKDT RKLLKDATSQ FEQAWNQ 



PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL 
TIGWKDAEPI REVLLDGAKI DGKQYGIPFN 
SKTIYEKSNK EWGAGFDSL NNYYAIGMKN 
YFRTAGSDKY LSGPFANKKV AMFVGS I AGA 
IYMFDSATPE QRTAAFEFMK FLATPDSQLY 
QLENAVKDLF AIPVEENADS AYNEMRTIME 



EF002-1 (SEQ ID NO: 5) 

TAAATAGCGG AGGTAGTACA 
TTAGCAGTGG CGGCAGTAAC 
GAAAAGAGTG AAGATGGCAA 
CCAGAATTTG AGAAATTATT 
CCGGTGGACA TTGCTTCAGA 
GATACGACGG ATATTTTAAC 
AATCAATTGG TGGATTTAAC 
AGTTACGAGA TGTATGAAAT 
TGGGTATTGT ATTAC AATAA 
TTAACTTGGG ATGAATATGA 
TATGGTGCCT ATCAACATAC 
AATGCCAATT TGATTGAACC 
AGAATGCAAA AAGATCAATC 
TATCAATCAC AATTTGAAAA 
GGGACTTTAT TAACAAACAT 
ATACCACAAC AAGAAAAAGG 
AATAAAAACA GTAAAAAACA 
GAAGGTGCAA AACTTTTAGC 
GATAAAATCT ACTTTGCAAG 
AC CC AGATAC AATTAATTTA 



AATGAAATTT TGGAAAAAAG 
TTTAACAGCA TGTGGTGGTT 
AACAAAATTA ACAGTAACTA 
CAGAGCTTTT GAAGCGGAAA 
TGATTATGAC ACAAAAGTAA 
CATGAAAAAC TTACTTTCAT 
CGATCACGTT AAAGATTTAG 
CGATGGTAAA ACCTATGCTC 
AAAAATGTTT GATGAAGCCG 
AGCGTTAGCG AAAAAATTAT 
TTGGCGCTCA ACCGTTCAAG 
AAAATACAAT TATATGGAAA 
ACAAATGGAT TTTGGAACAG 
TTCAAAAGCG GCGATGATGT 
TGATGATGGC AAAACAAATG 
CAAAGCAACT ACCTTTGGCT 
AAAAGCTGCT CAAAAATTCT 
AGAAGTAGGG GTGGTTCCTT 
AAAAGGAATG CCTTCAGACG 
G 



GCTTAACAGC GGCAGCGCTG 
CAAGTGAAAA GAAAGCAACT 
CTTGGAATTA TGACACGACC 
ATC CTGATAT CACTATTGAA 
CAACGATGCT TTCATCAGGA 
ATTCTAATTA CGCGCTACGC 
ATATCGAACC TGCCAAAGCA 
AGCCTTACCG TACAGATTTC 
GAATTGCCTA TCCCGATAAC 
CTAAACCAGA AGAACAAGTA 
CGATTGCTGC TGCTCAAAAC 
CTTATTATGA TCGCGCATTG 
CAAAATCAAC AAAAGTAACG 
ACATGGGTAG CTGGTACATG 
TCGAATGGGG GATTGCCGAA 
CACCGACAAG TTTTGCAATT 
TAGACTTTGC TTCAGGTAAA 
CTTATAAAAC AGATGAAATT 
AGTCTCACAA AAAGCCTTTA 



EF002-2 (SEQ ID NO : 6 ) 

MKFW KKGLTAAALL AVAAVTLTAC GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 
EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL LSYSNYALRN 
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 



WO 98/50554 



PCT/US98/08959 



81 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E, faecalis Genes. 

KIYFARKGMP SDESHKKPLT QIQLI 

EF002-3 (SEQ ID NO:7) 

A TGTGGTGGTT CAAGTGAAAA GAAAGCAACT 

GAAAAGAGTG AAGATGGCAA AACAAAATTA ACAGTAACTA CTTGGAATTA TGACACGACC 
CCAGAATTTG AGAAATTATT CAGAGCTTTT GAAGCGGAAA ATCCTGATAT CACTATTGAA 
CCGGTGGACA TTGCTTCAGA TGATTATGAC ACAAAAGTAA CAACGATGCT TTCATCAGGA 
GATACGACGG ATATTTTAAC CATGAAAAAC TTACTTTCAT ATTCTAATTA CGCGCTACGC 
AATCAATTGG TGGATTTAAC CGATCACGTT AAAGATTTAG ATATCGAACC TGCCAAAGCA 
AGTTACGAGA TGTATGAAAT CGATGGTAAA ACCTATGCTC AGCCTTACCG TACAGATTTC 
TGGGTATTGT ATTACAATAA AAAAATGTTT GATGAAGCCG GAATTGCCTA TCCCGATAAC 
TTAACTTGGG ATGAATATGA AGCGTTAGCG AAAAAATTAT CTAAACCAGA AGAACAAGTA 
TATGGTGCCT ATCAACATAC TTGGCGCTCA ACCGTTCAAG CGATTGCTGC TGCTCAAAAC 
AATGCCAATT TGATTGAACC AAAATACAAT TATATGGAAA CTTATTATGA TCGCGCATTG 
AGAATGCAAA AAGATCAATC ACAAATGGAT TTTGGAACAG CAAAATCAAC AAAAGTAACG 
TATCAATCAC AATTTGAAAA TTCAAAAGCG GCGATGATGT ACATGGGTAG CTGGTACATG 
GGGACTTTAT TAACAAACAT TGATGATGGC AAAACAAATG TCGAATGGGG GATTGCCGAA 
ATACCACAAC AAGAAAAAGG CAAAGCAACT ACCTTTGGCT CACCGACAAG TTTTGCAATT 
AATAAAAACA GTAAAAAACA AAAAGCTGCT CAAAAATTCT TAGACTTTGC TTCAGGTAAA 
GAAGGTGCAA AACTTTTAGC AGAAGTAGGG GTGGTTCCTT CTTATAAAAC AGATGAAATT 
GATAAAATCT ACTTTGCAAG AAAAGGAATG CCTTCAGACG AGTCTCACAA AAAGCCTTTA 
ACCCAGATAC AATTAATT 

EF002-4 (SEQ ID NO: 8) 

C GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 

EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL LSYSNYALRN 
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 
KIYFARKGMP SDESHKKPLT QIQLI 



EF003-1 (SEQ ID NO:9) 

TAGGAGGACA AAAGAATGAA GAAGTTTTAT TTAGCNACAT TCGCTGTTAT TGCAACAGTT 
ATTTTAGCTG CCTGTGGGGG AAATAAACAA GCAGACCAGA AAGAAGACAA GGAGATTACC 
GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA TTGCCAAGAA AGAAGCTGAG 
AAAAAAGGGT ACAAAATTAA CATTATGGAA GTGAGCGACA ATGTTGCCTA CAACGATGCC 
GTGCAACATG ACGAAGCGGA TGCTAATTTT GCGCAACATC AACCCTTCAT GGAAATGTTT 
AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA TTTATTATTT TGCTGGTGGT 
TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG AAAATGCCAA AGTGGGGATT 
CCTAGCGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT TAAATGCAAA CGGCGTGATT 
AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG ATGTCGTGGA AAATCCTAAA 
AACATCACTT TTGAAAGCAT TGATTTACTG AATTTAGCTA AAGCCTATGA TGAAAAAGAC 
ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG CTGGTTTAAC AACGAAAGAT 
GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG CATTGCAAGT TGTGACACGC 
AAAGGCGAAA AAGATAGCGA AAAAATCAAG GTTTTAAAAG AAGCGATGAC AACAAAAGAA 
GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGCCAATA TTCCTGCGTT TTAA 



EF003-2 (SEQ ID NO: 10) 
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MKKFYL ATFAVIATVI LAACGGNKQA DQKEDKEITV AVQLESSKDI LEIAKKEAEK 
KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF 
YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT VADWENPKN 
ITFESIDLLN LAKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQWTRK 
GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIPAF 

EF003-3 (SEQ ID NO:ll) 

CTGTGGGGG AAATAAACAA GCAGACCAGA AAGAAGACAA GGAGATTACC 
GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA TTGCCAAGAA AGAAGCTGAG 
AAAAAAGGGT ACAAAATTAA CATTATGGAA GTGAGCGACA ATGTTGCCTA CAACGATGCC 
GTGCAACATG ACGAAGCGGA TGCTAATTTT GCGCAACATC AACCCTTCAT GGAAATGTTT 
AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA TTTATTATTT TGCTGGTGGT 
TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG AAAATGCCAA AGTGGGGATT 
CCTAGCGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT TAAATGCAAA CGGCGTGATT 
AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG ATGTCGTGGA AAATCCTAAA 
AACATCACTT TTGAAAGCAT TGATTTACTG AATTTAGCTA AAGC C TATG A TGAAAAAGAC 
ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG CTGGTTTAAC AACGAAAGAT 
GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG CATTGCAAGT TGTGACACGC 
AAAGGCGAAA AAGATAGCGA AAAAATCAAG GTTTTAAAAG AAGCGATGAC AACAAAAGAA 
GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGC CAATA TTCCTGCGTT T 



EF0 03-4 (SEQ ID NO: 12) 

CGGNKQA DQKEDKEITV AVQLESSKDI LEIAKKEAEK 

KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF 
YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT VADWENPKN 
ITFESIDLLN LAKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQWTRK 
GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIPAF 

EF004-1 (SEQ ID NO:13) 

TAAATCGAAA GAAGGATGAT AGAAATGAAA AAAATGATTA AATTTGCAGG CATTGCTCTT 
ATTTTTGCAG CTCTTCTCTC TGCCTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC 
• GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA CGAGCCTAAT 
ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAACTC TGACGAGAAA 
TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT 
GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT 
GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG AATTCAATTA 
AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA 
CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT 
CCGTTAGTAA AATAA 

EF004-2 (SEQ ID NO: 14) 

MKK MIKFAGIALI FAALLSACSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT 
ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTS DNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP 
LVK 

EF004-3 (SEQ ID NO: 15) 
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. CTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC 
GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA CGAGCCTAAT 
ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAACTC TGACGAGAAA 
TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT 
GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT 
GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG AATTCAATTA 
AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA 
CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT 
CCGTTAGTAA AA 



EF004-4 (SEQ ID NO: 16) 

CSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT 

ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP 
LVK 



EF005-1 (SEQ ID NO: 17) 

TAAAAAATGA AAAAACGATT GACGATTGTG GGGATGCTTT TTCTGGCCAT TTTAGTAATG 
GTTGGTTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
GAATC CTTTG CCAATAGTGT AGCTAAACTG GATCAACAGC GCGAGGAAAG CAAGAATAAC 
TGGCCTGCAG AAGACTATGC TACAATTACT AAACGTTTGA AAAAAGGCTT AGATAAAATT 
GTTGCCACAG AATCAGCCAA TTCTGGGAAT GGCAATGTTT TAGTGGTCTC TCATGGCTTG 
AGTATTTCAG CGTTGTTAGC AACTTTATTT GATGATTTTA AAGTCCCAGA AGGCGGTTTG 
AAGAATGCTA GTGTCACAAC AATTCATTAC AAAAATGGCG AATATACTTT GGATAAAGTC 
AATGATGTCA GCTACTTAGA AGCAGGCGAA AAAGAATCAA AATAA 

EF005.-2 (SEQ ID NO: 18) 

MKKRLTIVG MLFLAILVMV GCGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG 
WSDAVLTPEG EKWTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDKIV ATESANSGNG NVLWSHGLS ISALLATLFD DFKVPEGGLK 
NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 

EF005-3 (SEQ ID NO:19) 

TTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
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GAATCCTTTG CCAATAGTGT AGCTAAACTG GATCAACAGC GCGAGGAAAG CAAGAATAAC 
TGGCCTGCAG AAGACTATGC TACAATTACT AAACGTTTGA AAAAAGGCTT AGATAAAATT 
GTTGCCACAG AATCAGCCAA TTCTGGGAAT GGCAATGTTT TAGTGGTCTC TCATGGCTTG 
AGTATTTCAG CGTTGTTAGC AACTTTATTT GATGATTTTA AAGTCCCAGA AGGCGGTTTG 
AAGAATGCTA GTGTCACAAC AATTCATTAC AAAAATGGCG AATATACTTT GGATAAAGTC 
AATGATGTCA GCTACTTAGA AGCAGGCGAA AAAGAATCAA AA 

EF005-4 (SEQ ID NO:20) 

CGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG 

WSDAVLTPEG EKWTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDKIV ATESANSGNG NVLWSHGLS ISALLATLFD DFKVPEGGLK 
NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 



EF006-1 (SEQ ID NO:21) 

TAAACGATAA ATGGAGGGAA TAAGATGAAA AAACGTACAT TATGGTCAGT AATTACTGTA 
GCAGTAGCTG TCTTAGTTTT AGGGGCTTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG 
AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA 
GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG 
GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCCGTT CTTTAATGAA 
GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT 
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC 
GTTAGCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA 
ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT 
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC 
AATGAAGAAG GGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT 
CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG 
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA 



EF006-2 (SEQ ID NO:22) 

MKK RTLWSVITVA VAVLVLGACG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 
KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE 

EF006-3 (SEQ ID NO:23) 

TTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG 

AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA 
GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG 
GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCCGTT CTTTAATGAA 
GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT 
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC 
GTTAGCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA 
ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT 
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC 
AATGAAGAAG GGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT 
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CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG 
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA 



EF006-4 (SEQ ID NO:24) 

CG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 

KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE 

EF008-1 (SEQ ID NO:25) 

TAAACCGTGA GAAAGAAATG GAGGAATCAA CGAATGAAAA AATTTAGTTT ATTTTTTTTA 
ACACTTTTAG CAGGGTTAAC GTTAGCTGCT TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 
AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAGTGAAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAAC CG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA ATAAAGATTT C TATAC AG AA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAAGGN 
ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA 
TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGGG TCTCAAAAGA AGTGAAACGA 
CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG AAGGAACAGA AGGCGATACG 
TACTACAGCA TGATGAACTG GAATTTAACA AAAATCCATG ATGGCTTAAT GAGTAAATAA 



EF008-2 (SEQ ID NO:26) 

MKKFSLFFLT LLAGLTLAAC GNQAAEKKEK LAIVTTNSIL SDLVKNVGQD 
KIELHSIVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL ETGGNGWFNK LMKTAKKVEN 
KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD PKNKDFYTEN 
AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI WEINTESQGT 
PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP IYDTLFTDSL AKEGTEGDTY 
YSMMNWNLTK IHDGLMSK 

EF008-3 (SEQ ID NO:27) 

T TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 

AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAGTGAAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAAC CG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA ATAAAGATTT CTATACAGAA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
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TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAAGGN 

ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA 

TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGGG TCTCAAAAGA AGTGAAACGA 

CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG AAGGAACAGA AGGCGATACG 

TACTACAGCA TGATGAACTG GAATTTAACA AAAATCCATG ATGGCTTAAT GAGTAAA 

EF008-4 (SEQ ID NO:28) 

C GNQAAEKKEK LAIVTTNSIL SDLVKNVGQD 

KIELHSIVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL ETGGNGWFNK LMKTAKKVEN 

KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD PKNKDFYTEN 

AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI WEINTESQGT 

PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP IYDTLFTDSL AKEGTEGDTY 
YSMMNWNLTK IHDGLMSK 



EF009-1 (SEQ ID NO:29) 

TGACAAATGA AAAAATTTAG TAAATTAATT GG AC TTATTG GGGTATTAGC TTTTACGATT 
GCAGGTTGTG CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT 
GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT 
GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAG AG CAT 
GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT 
AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA 
ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA AGTAGATCCT 
GCGAAACAGC AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAACGCCA ATTGAAAATA 
ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA 
GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAATAA 

EF009-2 (SEQ ID. NO:30) 

MKKFSKLIG LIGVLAFTIA GCASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 
LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVSIGNTV NAPLGIYANK 
LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQQEETKKVI AETSKGANVP AWETFGKK 

EF009-3 (SEQ ID NO:31) 

TTGTG CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT 
GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT 
GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAGAGCAT 
GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT 
AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA 
ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA AGTAGATCCT 
GCGAAACAGC AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAACGCCA ATTGAAAATA 
ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA 
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GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAA 



EF009-4 (SEQ ID NO:32) 

CASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 

LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVS IGNTV NAP LG I YANK 
LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQQEETKKVI AETSKGANVP AWETFGKK 



EF010-1 (SEQ ID NO:33) 

TGAAAGAATA AAATTGTACA GGAGGAAATA AGGAATGAAA AAATGGCAAA AAGGATTAGC 
CGTAGCTGGC GCACAGCTTT AGC TGTAGG A CTAAGCGCGT GCGGTAAATC TTCAAAAGAT 
GCAGCGTCAA AAGGTGATGA TAGTACACCA ACGTTATTAA TGTATCGTGT TGGGGACAAA 
CC AG ATAATT ATGACCAATT AATCGATAAT GCGAATAAAA TTATCGAGAA AAAAATTGGG 
GCAAAATTAA AAATGGAATT TGTTGGTTGG GGCGATTGGG AC C AAAAAAT GTCAACAATC 
GTTGCTTCTG GTGAAAGCTA TGATATTTCA TTAGCACAAA ATTATGCAAC GAATGCACAA 
AAAGGCGCCT ATGCTGATTT AACTGATTTA GCACCTAAAT ATGCCAAAGA AGCCTATGAT. 
CAATTGCCAG ATAACTATAT TAAAGGAAAT ACGATTAATG GAAAACTGTA. TGCGTTCCCA 
ATTTTAGGTA ACTCTTACGG TCAACAAGTT TTAACTTTTA ATAAAGAATA TGTCGATAAA 
TACAATTTAG ATATTAGTAA AGTCGATGGT AGTTATGAAA GTGCAACGGA AGTTCTAAAA 
GAATTCCNTA AAAANGANCC AAATATTGCT GCTTTTGCTA TCGGCCAAAC ATTCTTTGCA 
ACAGGTAATT ATGACTTCCC TATTGGTAAC CAATATCCAT TTGCAGTAAA AACAACTGAT 
ACTGGCTCAC CAAAAATTAT TAACCAATAT GCCGACAAAG ACATGATTAA TAACTTAAAA 
GTCTTGCATC AATGGTATAA AGATGGCTTG ATTCCAACAG ATGCTGCTAC AAGTACAACA 
CCATATGACT TAAATACCAA TACTTGGTTT ATGCGTCAAG AAACACAAGG ACCTATGGAT 
TATGGTGATA CAATCTTAAC ACAAGCTGCT GGCAAACCAC TTGTTTCTCG TCCACTAACA 
GAACCATTAA AAACAACAGC TCAAGCGCAA ATGGCTAACT ATGTTGTTGC AAACACGTCT 
AAAAACAAAG AAAAATCTGT TGAATTGTTA GGTTTATTAA ACAGCAATCC AGAATTGTTA 
AACGGACTTG TTTATGGTGA AGAAGGCAAA CAATATGAAA AAGTTGGCGA TGATCGTGTG 
AAATTGTTGA AAGATTACAC ACCAACAACT CATTTGAGTG CTTGGAACAC AGGAAACAAC 
TTAATCATTT GGCCAGAAGA ATCTGTCACT GAAGAAATGG TTAAAGAACG TGATAAGAGC 
ATCGAAGAAG CAAAAGATTC ACCAATTCTT GGTTTTACTT TTGTAAATGA TAAAGTGAAA 
ACTGAAATCA CTAACGTTGC TACAGTTATG AACCGTTACG CAGCAAGCTT AAATACAGGA 
ACTGTTGATC CAGAAGAAAC ACTTCCAAAA TTAATGGATG ACCTAAAAAC AGCTGGCTGG 
GATAAAGTTC AAAAAGAAAT GCAAACACAA TTAGACGAAT ATATCCAATC TCAAAAATAA 

EF010-2 (SEQ ID NO:34) 

MAKRISR SWRTALAVGL SACGKSSKDA ASKGDDSTPT LLMYRVGDKP 
DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG DWDQKMSTIV ASGESYDISL AQNYATNAQK 
GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 
NLDISKVDGS YESATEVLKE FXKXXPNIAA FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 
GSPKIINQYA DKDMINNLKV LHQWYKDGLI PTDAATSTTP YDLNTNTWFM RQETQGPMDY 
GDTILTQAAG KPLVSRPLTE PLKTTAQAQM ANYWANTSK NKEKSVELLG LLNSNPELLN 
GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH LSAWNTGNNL IIWPEESVTE EMVKERDKS I 
EEAKDSPILG FTFVNDKVKT EITNVATVMN RYAASLNTGT VDPEETLPKL MDDLKTAGWD 
KVQKEMQTQL DEYIQSQK 



EF010-3 (SEQ ID NO:35) 
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GT GCGGTAAATC TTCAAAAGAT 
GCAGCGTCAA AAGGTGATGA TAGTACACCA 
CCAGATAATT ATGACCAATT AATCGATAAT 
GCAAAATTAA AAATGGAATT TGTTGGTTGG 
GTTGCTTCTG GTGAAAGCTA TGATATTTCA 
AAAGGCGCCT ATGCTGATTT AACTGATTTA 
CAATTGCCAG ATAACTATAT TAAAGGAAAT 
ATTTTAGGTA ACTCTTACGG TCAACAAGTT 
TACAATTTAG ATATTAGTAA AGTCGATGGT 
GAATTCCNTA AAAANGANCC AAATATTGCT 
ACAGGTAATT ATGACTTCCC TATTGGTAAC 
ACTGGCTCAC CAAAAATTAT TAACCAATAT 
GTCTTGCATC AATGGTATAA AGATGGCTTG 
CCATATGACT TAAATACCAA TACTTGGTTT 
TATGGTGATA CAATCTTAAC ACAAGCTGCT 
GAACCATTAA AAACAACAGC TCAAGCGCAA 
AAAAACAAAG AAAAATCTGT TGAATTGTTA 
AACGGACTTG TTTATGGTGA AGAAGGCAAA 
AAATTGTTGA AAGATTACAC ACCAACAACT 
TTAATCATTT GGCCAGAAGA ATCTGTCACT 
ATCGAAGAAG CAAAAGATTC ACCAATTCTT 
ACTGAAATCA CTAACGTTGC TACAGTTATG 
ACTGTTGATC CAGAAGAAAC ACTTCCAAAA 
GATAAAGTTC AAAAAGAAAT GCAAACACAA 



ACGTTATTAA TGTATCGTGT TGGGGACAAA 
GCGAATAAAA TTATCGAGAA AAAAATTGGG 
GGCGATTGGG ACCAAAAAAT GTCAACAATC 
TTAGCACAAA ATTATGCAAC GAATGCACAA 
GC AC CTAAAT ATGCCAAAGA AGCCTATGAT 
ACGATTAATG GAAAAC TGTA TGCGTTCCCA 
TTAACTTTTA ATAAAGAATA TGTCGATAAA 
AGTTATGAAA GTGCAACGGA AGTTCTAAAA 
GCTTTTGCTA TCGGCCAAAC ATTCTTTGCA 
CAATATCCAT TTGCAGTAAA AACAACTGAT 
GCCGACAAAG ACATGATTAA TAACTTAAAA 
ATTCCAACAG ATGCTGCTAC AAGTACAACA 
ATGCGTCAAG AAACACAAGG ACCTATGGAT 
GGCAAACCAC TTGTTTCTCG TCCACTAACA 
ATGGCTAACT ATGTTGTTGC AAACACGTCT 
GGTTTATTAA ACAGCAATCC AGAATTGTTA 
CAATATGAAA AAGTTGGCGA TGATCGTGTG 
CATTTGAGTG CTTGGAACAC AGGAAACAAC 
GAAGAAATGG TTAAAGAACG TGATAAGAGC 
GGTTTTACTT TTGTAAATGA TAAAGTGAAA 
AACCGTTACG CAGCAAGCTT AAATACAGGA 
TTAATGGATG ACCTAAAAAC AGCTGGCTGG 
TTAGACGAAT ATATCCAATC TCAAAAA 



EF010-4 (SEQ ID NO:36) 
CGKSSKDA ASKGDDSTPT LLMYRVGDKP 

DNYDQLIDNA NKIIEKKIGA KLKMEFVGWG DWDQKMSTIV ASGESYDISL AQNYATNAQK 
GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 
NLDISKVDGS YESATEVLKE FXKXXPNIAA FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 
GSPKIINQYA DKDMINNLKV LHQWYKDGLI PTDAATSTTP YDLNTNTWFM RQETQGPMDY 
GDTILTQAAG KPLVSRPLTE PLKTTAQAQM ANYWANTSK NKEKSVELLG LLNSNPELLN 
GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH LSAWNTGNNL IIWPEESVTE EMVKERDKS I 
EEAKDSPILG FTFVNDKVKT EITNVATVMN RYAASLNTGT VDPEETLPKL MDDLKTAGWD 
KVQKEMQTQL DEYIQSQK 



EF011-1 (SEQ ID NO:37) 

TAACGTTTTT GGAGGAAAAG AATGAAAAAG 
ATGGGACTGT TAATGTTAAG TGCTTGTCAA 
ACAGAAACAA CAGCTAAAAC GGAAGTCACA 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG 
AAATACCAAA AAGTTGAATC AGCAGGCGGC 
CAACTAAAAC CAGACTTAAT TATTATTTCT 
AAAGCCATTG CGCCAACCAT TTACTTAGCT 
AAACAAAATA TCGAAACGTT AGGCACTATT 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC 



AAATTTTTAG CAATGATGGC AGTTTCAATG 
ACAAATAAAA AAACAGCAGA TTCTGCAACA 
GTCAAAGACA CCAATGGTCA ATTAACCGTT 
GATAATGGTT CCTTGGATAC AATGGATGCA 
CCAACTAAAA ATATCCCTGC GTATTTGAAA 
ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
GGTCGTCAAC AAGATTATCA AGAACAATTA 
GTAGATGCCA AAAATCCTTG GGCATCAACG 
TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
AAAGCAGCAG ACGATAAGAT TGAAGCTTCC 
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ACTCATGGGC AAAGTGTTTC TTACGAATAT GTTTTAGAAA AAAATCCTGG GATTCTCTTT 
GTGGTAGATC GCACCAAAGC AATTGGTGGC GACGATTCAA AAGATAACGT CGCTGCAAAC 
GAATTGATTC AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATTATGCT TCAACCAGAT 
GTTTGGTATC TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA 
• AAAGGATTAG AGTAA 

EF011-2 (SEQ ID NO:38) 

MKKK FLAMMAVSMM GLLMLSACQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 

KNPKKVWFD NGSLDTMDAL GVGDRWGAP TKNIPAYLKK YQKVESAGGI KEPDLEKINQ 

LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEK I 

TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST 

HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV 
WYLSGGGLES MHLMIEDVKK GLE 



EF011-3 (SEQ ID NO:39) 



TTGTCAA ACAAATAAAA AAACAGCAGA TT< 
ACAGAAACAA CAGCTAAAAC GGAAGTCACA 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG 
AAATACCAAA AAGTTGAATC AGCAGGCGGC 
CAACTAAAAC CAGACTTAAT TATTATTTCT 
AAAGCCATTG CGCCAACCAT TTACTTAGCT 
AAACAAAATA TCGAAACGTT AGGCACTATT 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC 
ACTCATGGGC AAAGTGTTTC TTACGAATAT 
GTGGTAGATC GCACCAAAGC AATTGGTGGC 
GAATTGATTC AAAAAACCGA TGCTGGTAAA 
GTTTGGTATC TAAGCGGTGG TGGATTAGAA 
AAAGGATTAG AG 



GTCAAAGACA CCAATGGTCA ATTAACCGTT 
GATAATGGTT CCTTGGATAC AATGGATGCA 
CCAACTAAAA ATATCCCTGC GTATTTGAAA 
ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
GGTCGTCAAC AAGATTATCA AGAACAATTA 
GTAGATGCCA AAAATCCTTG GGCATCAACG 
TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
AAAGCAGCAG ACGATAAGAT TGAAGCTTCC 
GTTTTAGAAA AAAATCCTGG GATTCTCTTT 
GACGATTCAA AAGATAACGT CGCTGCAAAC 
AATGATAAAG TCATTATGCT TCAACCAGAT 
TCAATGCATT TGATGATAGA AGATGTTAAA 



EF011-4 (SEQ ID NO:40) 

CQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 

KNPKKVWFD NGSLDTMDAL, GVGDRWGAP TKNIPAYLKK YQKVESAGGI KEPDLEKINQ 
LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEK I 
TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST 
HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV 
WYLSGGGLES MHLMIEDVKK GLE 



EF012-1 (SEQ ID NO:41) 

TGAGGGGGCA ACAACATGAA ATTGGGGAAA 
CTTTTAGCCG CATGTGGCGG AACCAAAGAA 
GCAGCTGAAC AAAAAATCAG TATTAGTTCA 
CAAACAACAG ATAAAAATAC CTTTACAATG 
TTTGATGATG ATAGTGCCAC GGTGCCAGCT 



AAAGTAGTAG GTTTGATTGC AACAGGGTTT 
GCGGCAGAGA AAGTAGATTC GGGAAATTTA 
CCTGCACCAA TCTCAACATT GGATACAACA 
GCACAACATT TATTTGAAGG CCTTTATCGG 
CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 
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GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG CGAGCCAATC 
ACGGCCCAAG ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG 
AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA 
GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG 
GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA 
TTTGTCGAAG CGCAAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT TTATAGCGGG 
CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA 
GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA 
GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAACGGA 
CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC 
TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTACGAAAA 
GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA CGATGGGTCA 
AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT 
TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG 
ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA 
GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT 
TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA 
AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT 
AACTTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATC ATAATGC CAAATACGAC 
CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA 
TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA 
AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG 
GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAATGA 

EF012-2 (SEQ ID NO:42) 

MKLGKK WGLIATGFL LAACGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 
TTDKNTFTMA QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEF I VELKQA 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE 
YYDADQVKLE EVA VST IKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY 
FLDFNKKEGT PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD. QGKRIAEYVQ SQLQENLPGL 
EITISSQPSN NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY GNYHNAKYDQ 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD 
YFHLRNAYLT E 



EF012-3 (SEQ ID NO: 43) 

ATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATTTA 

GCAGCTGAAC AAAAAATCAG TATTAGTTCA CCTGCACCAA TCTCAACATT GGATACAACA 

CAAACAACAG ATAAAAATAC CTTTACAATG GCACAACATT TATTTGAAGG CCTTTATCGG 

TTTGATGATG ATAGTGCCAC GGTGCCAGCT CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 

GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG CGAGCCAATC 

ACGGCCCAAG ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG 

AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA 

GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG 

GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA 

TTTGTCGAAG CGCAAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT TTATAGCGGG 

CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA 

GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA 

GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAACGGA 
CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC 
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TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTACGAAAA 
GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA CGATGGGTCA 
AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT 
TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG 
ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA 
GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT 
TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA 
AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT 
AAC TTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC CAAATACGAC 
CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA 
TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA 
AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG 
GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAA 



EF012-4 (SEQ ID NO:44) 

CGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 

TTDKNTFTMA QHLFEGLYRF DDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFIVELKQA. 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE 
YYDADQVKLE EVAVSTIKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY 
FLDFNKKEGT PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL 
EITISSQPSN NVNQSRREKN YELSLSGWIA GSSELDSYFN LYAGESSYNY GNYHNAKYDQ 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD 
YFHLRNAYLT E 



EF013-1 (SEQ ID NO:45) 

TAACGAAAAA TGAAAAAAAT TGCTTTGTTC AGTATGTTAA CGTTCAGTGT ATTGTCTTTA 
AGTCTAGCAG GATGTGGAAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA 
GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG 
TCTGTCACGG GTCAAAATTC TAATGTGACA GGGGAAAATC CGTCAGAAAA TGCCACGCAG 
CCTTCTGCAG GAACTGATGA AACGAATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC 
ATTACAATTA CCAATGTTGT TTTCAATCCT GAAAGAAATG AAATTAATGG TACTACATTA 
CCTAATGCAA CCATTACAGC AACGGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT 
TATGCGGATG CCAATGGCAA TTTTACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA 
TTAATCGCAA CCGTTGATCA ACGGAATAGT GCACCTGTCC AAATTGATAT TCCAAGTTCA 
GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT 
TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 
ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC 
GGCACAAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT 
GATTTACCAA ATTAA 

EF013-2 (SEQ ID NO:46) 

MKKIALFS MLTFSVLSLS LAGCGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS 
VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWFNPE RNEINGTTLP 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG 
TKNRLDVTLN GEIGTPYLFD LPN 
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EF013-3 (SEQ ID NO:47) 

ATGTGGAAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA 

GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG 

TCTGTCACGG GTCAAAATTC TAATGTGACA GGGGAAAATC CGTCAGAAAA TGCCACGCAG 

CCTTCTGCAG GAACTGATGA AACGAATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC 

ATTACAATTA CCAATGTTGT TTTCAATCCT GAAAGAAATG AAATTAATGG TACTACATTA 

CCTAATGCAA CCATTACAGC AACGGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT 

TATGCGGATG CCAATGGCAA TTTTACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA 

TTAATCGCAA CCGTTGATCA ACGGAATAGT GCACCTGTCC AAATTGATAT TCCAAGTTCA 

GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT 

TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 

ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC 

GGC AC AAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT 
GATTTACCAA AT 

EF013-4 (SEQ ID NO:48) 

CGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS 
VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI 
TKNRLDVTLN GEIGTPYLFD LPN 

EF014-1 (SEQ ID NO:49) 

TGATGGTGGA GACTTTTTAA GAGAGAGGAA GTACAGCCAA TGAGTAGGAA GCGAAAAATC 
AGCTTAATTA GTTTAGTCAT CATTTTGGTT TTTGTCACAG TCGGCTCAGC ATACTTTGCT 
GTAGCGGGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT 
TATAATGAAG CGCAAAATAA AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA 
ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGGTGATTAC CGTGAATAAC 
AAGACGAAGA AAATAACCTA TTTAAGTTTG CCACGGGATA GTTTTGTTCA AATTGATGCG 
AAAAATT AC C AAGGGATGCA GCGAATTGAA GCCGCCTATA CCTACGATGG ACCAACAGCT 
TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC 
TTTTTATCTT TTATTAAGTT AATTGATGCG GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 
GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC AGGGAAACAG 
CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTG AAA GACATAGCGA TAACGATATT 
ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTGGT 
CAATCAATCA TGAAAATAAT GGACATTATT GATTCGTTAA ATGGAAACAT TCAAACTGAT 
GTGGATTCCA ATGAATTGAC TCATTTAGTC AAAGAAGGTT TGACTTGGAC CAATTATGAT 
AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA 
CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTGTGTCTTT AAATTTAGAA 
AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT 
CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTCCATCAAC 
GGC AATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC ATCAGTTAAA 
ACGGAAAATG GCTTTATAAA ATAA 

EF014-2 (SEQ ID NO:50) 

MSRKRKIS LISLVIILVF VTVGSAYFAV AGSYLKKTID KGYVPIKNDY 
NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP RDSFVQIDAK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF LSFIKLIDAV GG I DVNVKQ A 
FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ AVEDKLKSGQ 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF SNEGRSMVEL 



TITNWFNPE RNEINGTTLP 
IATVDQRNSA PVQ I DI PS SG 
LASFTADAQG NFTASNLVPG 
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YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ SDYTVQDEAA EENEMTSING 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF014-3 (SEQ ID NO:51) 

TGCT 

GTAGCGGGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT 
TATAATGAAG CGCAAAATAA AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA 
ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGGTGATTAC CGTGAATAAC 
AAGACGAAGA AAATAACCTA TTTAAGTTTG CCACGGGATA GTTTTGTTCA AATTGATGCG 
AAAAATTACC AAGGGATGCA GCGAATTGAA GCCGCCTATA CCTACGATGG ACCAACAGCT 
TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC 
TTTTTATCTT TTATTAAGTT AATTGATGCG GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 
GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC AGGGAAACAG 
CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTGAAA GACATAGCGA TAACGATATT 
ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTGGT 
CAATCAATCA TGAAAATAAT GGACATTATT GATTCGTTAA ATGGAAACAT TCAAACTGAT 
GTGGATTCCA ATGAATTGAC TCATTTAGTC AAAGAAGGTT TGACTTGGAC CAATTATGAT 
AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA 
CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTGTGTCTTT AAATTTAGAA 
AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT 
CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTCCATCAAC 
GGCAATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC ATCAGTTAAA 
ACGGAAAATG GCTTTATAAA A 



EF014-4 (SEQ ID NO:52) 
AV AGSYLKKTID KGYVPIKNDY 

NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP RDSFVQIDAK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF LSFIKLIDAV GG I DVNVKQ A 
FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ AVEDKLKSGQ 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF SNEGRSMVEL 
YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ SDYTVQDEAA EENEMTSING 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF015-1 (SEQ ID NO:53) 

TAATTAAAAA TGTGTAAAAA GGGTCTGATG AAAAAAGGAG ACATAATAGT TATTATCTTT 
TTAATAGCTA TCTCTTTTTC TCCATATTTT ATTTTTTTTC ACAATAATCC ATTTAACTCC 
AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA GATTGAGCGT 
ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATCCATCAAA AGGGCAATAT 
AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG TCCAGATCAA 
ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTAGTATCTG TATTCCTCAC 
AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA CTAA 

EF015-2 (SEQ ID NO: 54) 

MK KGDIIVIIFL IAISFSPYFI FFHNNPFNSK SFDDTKYAW KIDGKEIERI 
NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI AVKTGWISEP GXTSICIPHR 
FILEIVQQYS KDYYIY 



EF015-3 (SEQ ID NO: 55) 
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CAATAATCC ATTTAACTCC 

AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA GATTGAGCGT 
ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATC CATC AAA AGGGCAATAT 
AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG TCCAGATCAA 
ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTAGTATCTG TATTCCTCAC 
AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA C 

EF015-4 (SEQ ID NO:56) 

NNPFNSK SFDDTKYAW KIDGKEIERI 

NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI AVKTGWISEP GXTSICIPHR 
FILEIVQQYS KDYYIY 



EF016-1 (SEQ ID NO:57) 

TGACGGTTGC CCCCGTCCAA TAGAAAGGAG TTTATGATGA AAAAGAAATA TTCTTTAGCC 
TTGCTGGTTA TCTGTTGTAG TTTACTCCTA TTTGCAGGTT GTGGTAAAAG AAAAAGCAAC 
GAAGATCAAT GGACACGGAT TAACGAAGAA AAACGGATTA TTATTGGCTT AGATGACTCC 
TTTGTGCCCA TGGGTTTTCA AGATAAATCA GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
GCCAAAGCGG TTTTTAAACT TTATGGCATT TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATGAAAGAAA CAGAATTACA AAATCAAACC ATTGATCTTA TTTGGAACGG CTACACTAAA 
ACGAGCGAGC GGGCCGAAAA AGTTCAATTC ACACAACCTT ACATGACGAA CGACCAAGTA 
CTTGTTTCTT TAAAAGAAAA AAACATTGCA ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGGGTTCAAA ACGGCTCTTC TGGCTATGAT GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA 
AAATTTGTTA AAGACCAAAC ACCTATTTTA TATGACGGCT TTAATGAAGC TTTCTTAGAT 
TTAAAATCTG GTCGAATTGA CGGACTCCTA ATCGATCGCG TTTACGCCAA CTACTATCTT 
TCCCACGAAG ATAATTTAAA AAACTATACT ATTTCTCATG TAGGCTATGA CAATGAAGAT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAAT CAATTAGTCC AAAAAATCAA TACTGCCTTT 
GAAACGTTAC GAAAAGATGG CACCCTTAGT AAAATTTCTC AAAAATGGTT TGGAGAGGAC 
GTTACAAATA ACACAAAAAT AAACTAA 



EF016-2 (SEQ ID NO:58) 

MMKKKYSLAL LVICCSLLLF AGCGKRKSNE DQWTRINEEK RIIIGLDDSF 
VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS VDFQPIDWSM KETELQNQTI DLIWNGYTKT 
SERAEKVQFT QPYMTNDQVL VSLKEKNIAT ASDMQGKILG VQNGSSGYDG FESQPDVLKK 
FVKDQTPILY DGFNEAFLDL KSGRIDGLLI DRVYANYYLS HEDNLKNYTI SHVGYDNEDF 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK ISQKWFGEDV TNNTKIN 



EF016-3 (SEQ ID NO:59) 
AAGCAAC 

GAAGATCAAT GGACACGGAT TAACGAAGAA AAACGGATTA TTATTGGCTT AGATGACTCC 
TTTGTGCCCA TGGGTTTTCA AGATAAATCA GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
GCCAAAGCGG TTTTTAAACT TTATGGCATT TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATGAAAGAAA CAGAATTACA AAATCAAACC ATTGATCTTA TTTGGAACGG CTACACTAAA 
ACGAGCGAGC GGGCCGAAAA AGTTCAATTC ACACAACCTT ACATGACGAA CGACCAAGTA 
CTTGTTTCTT TAAAAGAAAA AAACATTGCA ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGGGTTCAAA ACGGCTCTTC TGGCTATGAT GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA 
AAATTTGTTA AAGACCAAAC ACCTATTTTA TATGACGGCT TTAATGAAGC TTTCTTAGAT 
TTAAAATCTG GTCGAATTGA CGGACTCCTA ATCGATCGCG TTTACGCCAA CTACTATCTT 
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TCCCACGAAG ATAATTTAAA AAAC TATAC T ATTTCTCATG TAGGCTATGA CAATGAAGAT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAAT CAATTAGTCC AAAAAATCAA TACTGCCTTT 
GAAACGTTAC GAAAAGATGG CACCCTTAGT AAAATTTCTC AAAAATGGTT TGGAGAGGAC 
GTTACAAATA ACACAAAAAT AAAC 

EF016-4 (SEQ ID NO:60) 

SNE DQWTRINEEK RIIIGLDDSF 

VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS VDFQPIDWSM KETELQNQTI DLIWNGYTKT 
SERAEKVQFT QPYMTNDQVL VSLKEKNIAT ASDMQGKILG VQNGSSGYDG FESQPDVLKK 
FVKDQTPILY DGFNEAFLDL KSGRIDGLLI DRVYANYYLS HEDNLKNYTI SHVGYDNEDF 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK ISQKWFGEDV TNNTKIN 



EF017-1 (SEQ ID NO:61) 

TGAGGTGTTT TTATGAAAAG GGCAAGAAAG CAAAGGCTGT CTTTGGCAGC AATCATGGTT 
CTACTTCTCT CGGGCTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA 
TTACGGGTCG GGATTGATTC GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT 
ACCGCAGCAG ATGTAATGAG CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 
GAAGCGAAAC CAGCATTGGC AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT 
ACTTTTACGA TTCGAAAAGA TGCAAAATGG AGTAACGGCG AGCCAATCAC AGCAAATGAT 
TTTGAATACT CTTGGAAGCG CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC 
TTTGAAGGGT TAAAAAATTA TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA 
GGGGTAACAG CCATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT 
TTTCAACAAT TATTGGCGGT ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA 
ACGGGCAAAA ACTATGGTAC ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA 
GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT 
CAAGCGAATG TTTCGCTAGA TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG 
AAAAATCTTT TCGAAGGGAA AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 
CAAGAACAAG GCAATGCAGC TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA 
TTAAATACGC AAAAAGATCT TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA 
TTGAATTCTG AGCGTTTAGC TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC 
TTCGTGCCAA CAGGTTTCAC TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 
GATTTAAATC CTAGTGAACC AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 
TTAGGAATTG AAAAAGCGGA GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA 
ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT 
TCACCAGTTC CTTTTAATAA TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG 
GTTGGTGGCT GGACGCCAGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 
AAAAATTCCA ATAATTTTGG TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA 
GCAAACGTAA CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 
CAATTGGTTG CGGAAGAAGC CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA 
GTGGCCGATT CTGTCCAAAA TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA 
GTCTCTATCG GCGACAAGTA A 



EF017-2 (SEQ ID NO: 62) 

MKRATKQ RLSLAAIMVL LLSGCGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 
AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 
EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 
QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ 
ANVSLDKVDV QWKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL 
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 
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LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS 
PVPFNNRLEK SRSGDFDIW GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA 
NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV 
SIGDK 

EF017-3 (SEQ ID NO: 63) 

CTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA 

TTACGGGTCG GGATTGATTC GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT 
ACCGCAGCAG ATGTAATGAG CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 
GAAGCGAAAC CAGCATTGGC AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT 
ACTTTTACGA TTCGAAAAGA TGCAAAATGG AGTAACGGCG AGCCAATCAC AGCAAATGAT 
TTTGAATACT CTTGGAAGCG CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC 
TTTGAAGGGT TAAAAAATTA TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA 
GGGGTAACAG C C ATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT 
TTTCAACAAT TATTGGCGGT ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA 
ACGGGCAAAA ACTATGGTAC ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA 
GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT 
CAAGCGAATG TTTCGCTAGA TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG 
AAAAATC TTT TCGAAGGGAA AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 
CAAGAACAAG GCAATGCAGC TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA 
TTAAATACGC AAAAAGATCT TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA 
TTGAATTCTG AGCGTTTAGC TAAAAATGTT TTAAATGATG GCTGAAAAAA AGCACTTGGC 
TTCGTGCCAA CAGGTTTCAC TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 
GATTTAAATC CTAGTGAACC AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 
TTAGGAATTG AAAAAGCGGA GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA 
ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT 
TCACCAGTTC CTTTTAATAA TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG 
GTTGGTGGCT GGACGCCAGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 
AAAAATTCCA ATAATTTTGG TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA 
GCAAACGTAA CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 
CAATTGGTTG CGGAAGAAGC CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA 
GTGGCCGATT CTGTCCAAAA TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA 
GTCTCTATCG GCGACAAG 



EF017-4 (SEQ ID NO: 64) 

CGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 

AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 
EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 
QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ 
ANVSLDKVDV QVVKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL 
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 
LNPSEPEKAK ELWQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS 
PVPFNNRLEK SRSGDFDIW GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA 
NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV 
SIGDK 



EF018-1 (SEQ ID NO: 65) 

TGTCATTACA ACGATACCAA TTTTAATCAT 
CGGTATGATG GCCGGTGCAG TAAAAGAATA 



TTATCCATTA CTACAAAAAC ACTTTATCGG 
AAGAAAGTAG GGAACAATAT GAAAAAAGTT 
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TTAGGCGGTT TATTGGTGGC AACGGCGGTC GTTAGTTTAG CGGCCTGTAG CGGTGGGGAA 
AAGAAAGCTA GCTCAGATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGACGCCT 
GCTTGGAAGT TAGATAAGAA GAAAGAACCG ACCAAGATTA AATGGTATAT T AAC TC AG AT 
TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC AG ATTAAAAA AG AC TTAAAT 
GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA 
GATATGCCTG ATATCGTGAC ATTAACTGAA AAAAC TGG AC AAGCAGCATT GAAAGCAGAT 
TCTTGGGCCT ATTC TTTAAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT 
GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TGGTTACCCT 
AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TGATAATTTT 
GTTATTCGTG AAGATGTCTA TAATGCATTA GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT 
TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC 
ACCACAGTGG GCGATGGTGC AGGACCATTT TTAGACAAAT TACAAGACTT CTTAGGTGTT 
CCTTTAGAGG ATAAAAATGG TAAATACTAT GATCGAAATT TAGATAAAGA ATATTTAGAA 
TGGTTAAAAA CATTTAATGA TGTTTACCGA GCAGGCAATA TTAGTGATGA TAGCTTCACA 
GATGATGGGG CAACGTTTGA TGAAAAAGTG AAACAAGGAA ATTATGCAAC CATGCTCGTT 
GCTGGAACCA GTGGTCAAGG TGGGAACTTC ACAGAATTTA TGAAAAAATC TGGCACACGT 
TATATAGCCA TTGATGGACC AAGTAGCACT TCTGGCCGAA AACCAACATT AAATCAAACC 
GGCATTTCAG GTTGGTTAAG TAATTACATT ACGAAAGATG CGAAAGATCC AGCAAAAGTC 
ACTCAACTGT TCACATATTT AATTGATGAA C CGGGAC AAA TTTTAACAAA ATATGGCGTT 
GAAGGAGTTA CTTATGCGTA CAATGATCAA GGAAAAATTG ATTATTTACC AGAAGTGAAA 
AAATTAGAAC AAACAGACAA TGATGCCTAC AACAAAAAAT ATGGCATTAG TCGTTTCCTA 
TACTTTAACA ACGACCGTGT CAATAAACTA AAAGTACCAA TGGAAAGTGC TTTAACGCAA 
ATGCAAGAAT GGGGCAAAGG AAAATTAGTC CCACATTTCG TAATTGAAAA TATTAATCCA 
GATGCAGGAA CGCCGGAAGC TCGTGCGAAT GAAGCGATTG AAACCAAACT AAATAC AAC C 
GTTATTTCAA TGATTCGTGC GAAAGATGAT AAAGCCTTTG ACAAATCTTT AG AAG AC TAC 
AAAGCATTCT TAAAATCAAA TAAATGGGAT GCAATTGAAA AAATAAAATC TGAGAAAATG 
GCGGAAAACA GAGACAAACT TAAGTAA 

EF018-2 (SEQ ID NO:66) 

MKKV LGGLLVATAV VSLAACSGGE 

KKASSDVSIK DRYELDEKTP AWKLDKKKEP TKIKWYINSD WTALPFGKDV TTAQIKKDLN 
VDIEFISGDD SKLNAMISSG DMPDIVTLTE KTGQAALKAD SWAYSLNDLA KKYDPYLMKV 
VNQDTFKWYA LEDGKTYGYP NYSNTKADYE SGNIPVNDNF VIREDVYNAL GKPDVSTPEN 
FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE 
WLKTFNDVYR AGNISDDSFT DDGATFDEKV KQGNYATMLV AGTSGQGGNF TEFMKKSGTR 
YIAIDGPSST SGRKPTLNQT GISGWLSNYI TKDAKDPAKV TQLFTYLIDE PGQILTKYGV 
EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY NKKYGISRFL YFNNDRVNKL KVPMESALTQ 
MQEWGKGKLV PHFVIENINP DAGTPEARAN EAIETKLNTT VISMIRAKDD KAFDKSLEDY 
KAFLKSNKWD AIEKIKSEKM AENRDKLK 

EF018-3 (SEQ ID NO: 67) 

CTGTAG CGGTGGGGAA 

AAGAAAGCTA GCTCAGATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGACGCCT 
GCTTGGAAGT TAGATAAGAA GAAAGAACCG ACCAAGATTA AATGGTATAT TAACTCAGAT 
TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC AGATTAAAAA AGACTTAAAT 
GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA 
GATATGCCTG ATATCGTGAC ATTAACTGAA AAAAC TGG AC AAGCAGCATT GAAAGCAGAT 
TCTTGGGCCT ATTCTTTAAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT 
GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TGGTTACCCT 
AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TGATAATTTT 
GTTATTCGTG AAGATGTCTA TAATGCATTA GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT 
TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC 
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ACCACAGTGG GCGATGGTGC AGGACCATTT TTAGACAAAT TACAAGACTT CTTAGGTGTT 
CCTTTAGAGG ATAAAAATGG TAAATACTAT GATCGAAATT TAGATAAAGA ATATTTAGAA 
TGGTTAAAAA CATTTAATGA TGTTTACCGA GCAGGCAATA TTAGTGATGA TAGCTTCACA 
GATGATGGGG CAACGTTTGA TGAAAAAGTG AAACAAGGAA ATTATGCAAC CATGCTCGTT 
GCTGGAACCA GTGGTCAAGG TGGGAACTTC ACAGAATTTA TGAAAAAATC TGGCACACGT 
TATATAGCCA TTGATGGACC AAGTAGCACT TCTGGCCGAA AACCAACATT AAATCAAACC 
GGCATTTCAG GTTGGTTAAG TAATTACATT ACGAAAGATG CGAAAGATCC AGCAAAAGTC 
ACTCAACTGT TCACATATTT AATTGATGAA CCGGGACAAA TTTTAACAAA ATATGGCGTT 
GAAGGAGTTA CTTATGCGTA CAATGATCAA GGAAAAATTG ATTATTTACC AGAAGTGAAA 
AAATTAGAAC AAACAGACAA TGATGCCTAC AACAAAAAAT ATGGCATTAG TCGTTTCCTA 
TACTTTAACA ACGACCGTGT CAATAAACTA AAAGTACCAA TGGAAAGTGC TTTAACGCAA 
ATGCAAGAAT GGGGCAAAGG AAAATTAGTC CCACATTTCG TAATTGAAAA TATTAATCCA 
GATGCAGGAA CGCCGGAAGC TCGTGCGAAT GAAGCGATTG AAACCAAACT AAATACAACC 
GTTATTTCAA TGATTCGTGC GAAAGATGAT AAAGCCTTTG ACAAATCTTT AGAAGACTAC 
AAAGCATTCT TAAAATCAAA TAAATGGGAT GCAATTGAAA AAATAAAATC TGAGAAAATG 
GCGGAAAACA GAGACAAACT TAAG 

EF018-4 (SEQ ID NO: 68) 

CSGGE 

KKASSDVSIK DRYELDEKTP AWKLDKKKEP TKIKWYINSD WTALPFGKDV TTAQIKKDLN 

VDIEFISGDD SKLNAMISSG DMPDIVTLTE KTGQAALKAD SWAYSLNDLA KKYDPYLMKV 

VNQDTFKWYA LEDGKTYGYP NYSNTKADYE SGNIPVNDNF VIREDVYNAL GKPDVSTPEN 

FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE 

WLKTFNDVYR AGNISDDSFT DDGATFDEKV KQGNYATMLV AGTSGQGGNF TEFMKKSGTR 

YIAIDGPSST SGRKPTLNQT GISGWLSNYI TKDAKDPAKV TQLFTYLIDE PGQILTKYGV 

EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY NKKYGISRFL YFNNDRVNKL KVPMESALTQ 

MQEWGKGKLV PHFVIENINP DAGTPEARAN EAIETKLNTT VISMIRAKDD KAFDKSLEDY 
KAFLKSNKWD AIEKIKSEKM AENRDKLK. 



EF019-1 (SEQ ID NO:69) 

TAAAGGAGTT ACACAATGAA ACTTTTAAAA AAGACGGTCC TAATTGGTAC AACCCTTCTT 
CTTGGTTCAT TCTTACTCGC AGCTTGTGGT AATACGAATA AAGAAGCCAA CAACGCTGAC 
AAAACACATG AAGTAACAGA TACCTTAGGC AATAAAGTAA CCGTCCCCGC GAAACCCAAA 
CGGATTATTG CGAGTTATTT AGAAGATTAT CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA 
CAATGGACAG TTGGACAAGG CAGCATTCAA GATTATTTAG CGAAAGAATT GAAAGATGTC 
CCCACTATTT CCTATGACTT GCC ATATGAA GCGGTTCTAA AATTTGAACC TGACTTATTA 
TTAATCAGTT CATCTGCTCT AGTTGAAGGC GGTAAATACA AAGAATACAG TAAAATTGCG 
CCAACTTATG TAGTCAAAAA CGGCGAAAAT GTCACCTGGC GTGATCAATT GGAAGATATT 
GCCACTGTTT TAGATAAAAA AGAACAAGCG AAAAAAGTGT TAGAAGATTA TGATACCTTA 
ACCAAAGGCG TCCAAGAATA TCTTGGCAAA AAAGATGCTG GCAAATCTGC GGCAGTCTTA 
TGGGTAACCA ACAACCAAGT CTTTATGGTT AGCGATAATC GCTCAAGCGG AACCGTGCTC 
TATCAGGACT TAGGCCTCCA AGTTCCAAAA TTAGTGGAAG AAATTTCTAA AAACGCTACT 
GCGGATTGGA ATCAAGTTTC TTTAGAAAAA TTAGCTGAGC TTGACGCAGA CCACATTTTC 
CTTGTAAACA GCGATGAATC AGCACCTCTT TTCCAAGAAG CAATTTGGAA GAACTTACCT 
GCTGTGAAAA ATAACCAAGT TCATACCTAT GATAAAAAAA GTAGTTGGTT ATACAACGGA 
CCTATTGCGA ATACTCAAAT TGTTGAAGAT GTAAAAAAAG CGCTCTTAAA TTAA 

EF019-2 ( (SEQ ID NO:70) 

MKLLKK TVLIGTTLLL GSFLLAACGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 

I I AS YLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 
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ISSSALVEGG KYKEYSKIAP TYWKNGENV TWRDQLEDIA TVLDKKEQAK KVLEDYDTLT 
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL VEEISKNATA 
DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP 
IANTQIVEDV KKALLN 

EF019-3 (SEQ ID NO:71) 



TTGTGGT AATACGAATA AAGAAGCCAA CAi 
AAAACACATG AAGTAACAGA TACCTTAGGC 
CGGATTATTG CGAGTTATTT AGAAGATTAT 
CAATGGACAG TTGGACAAGG CAGCATTCAA 
CCCACTATTT CCTATGACTT GCCATATGAA 
TTAATCAGTT CATCTGCTCT AGTTGAAGGC 
CCAACTTATG TAGTCAAAAA CGGCGAAAAT 
GCCACTGTTT TAGATAAAAA AGAACAAGCG 
ACCAAAGGCG TCCAAGAATA TCTTGGCAAA 
TGGGTAACCA ACAACCAAGT CTTTATGGTT 
TATCAGGACT TAGGCCTCCA AGTTCCAAAA 
GCGGATTGGA ATCAAGTTTC TTTAGAAAAA 
CTTGTAAACA GCGATGAATC AGCACCTCTT 
GCTGTGAAAA ATAACCAAGT TCATACCTAT 
CCTATTGCGA ATACTCAAAT TGTTGAAGAT 



EF019-4 (SEQ ID NO:72) 



AATAAAGTAA CCGTCCCCGC GAAACCCAAA 
CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA 
GATTATTTAG CGAAAGAATT GAAAGATGTC 
GCGGTTCTAA AATTTGAACC TGACTTATTA 
GGTAAATACA AAGAATACAG TAAAATTGCG 
GTC AC CTGGC GTGATCAATT GGAAGATATT 
AAAAAAGTGT TAGAAGATTA TGATACCTTA 
AAAGATGCTG GCAAATCTGC GGC AGTCTTA 
AGCGATAATC GCTCAAGCGG AACCGTGCTC 
TTAGTGGAAG AAATTTCTAA AAACGCTACT 
TTAGCTGAGC TTGACGCAGA CCACATTTTC 
TTCCAAGAAG CAATTTGGAA GAACTTACCT 
GATAAAAAAA GTAGTTGGTT ATACAACGGA 
GTAAAAAAAG CGCTCTTAAA T 



CGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 

I I AS YLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 
ISSSALVEGG KYKEYSKIAP TYWKNGENV TWRDQLEDIA TVLDKKEQAK KVLEDYDTLT 
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL VEEISKNATA 
DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP 
IANTQIVEDV KKALLN 



EF020-1 (SEQ ID NO:73) 



TGAGGAGATG AGAAAATGAA AAAGGTAGTT 
AC ATTAAC TG CATGTAATGG TTCTAAATTA 
ATAATGAAAG ATTCTTCATA TGGTGATGAA 
TATAAAGATA AAGACACTAA TCGTTATTTG 
ACTAGCGCAT TGGAGTATTT TTATTATTAT 
AGTAAAGTAA CCTTTGATGA TATGAAAGCT 
GGGAAATTTA AATAA 



TCAATTTTGT TGATGGTTGT TGCAGTCTTC 
GATAAAACAG GTGAAGAATT TAAAAATTCT 
TATTCAGAAG ATGGTTTTAG TTTTTTAATA 
GCTGATGTTT GGGTTCCTGT TAAAGATGAA 
GATGAAGATA AGCGATTAGA TAGTACTAAA 
AGTGGAAACT ATGAAGTAGT GTATAAATCA 



EF020-2 (SEQ ID NO: 74) 



MKKWS ILLMWAVFT LTACNGSKLD KTGEEFKNS I MKDSSYGDEY SEDGFSFLIY 
KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEWYKSG 
KFK 



EF020-3 (SEQ ID NO:75) 

ATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT 
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ATAATGAAAG ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA 
TATAAAGATA AAGACACTAA TCGTTATTTG GCTGATGTTT GGGTTCCTGT TAAAGATGAA 
ACTAGCGCAT TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA 
AGTAAAGTAA CCTTTGATGA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA 
GGGAAATTTA AA 

EF020-4 (SEQ ID NO:76) 

CNGSKLD KTGEEFKNSI MKDSSYGDEY SEDGFSFLIY 

KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEWYKSG 
KFK 



EF021-1 (SEQ ID NO:77) 

TAGTTGTTTA AATACATTAA 
TTATTCGGTT TTAGTTTGAT 
GGCAAAGGCA AAACCGCTGA 
ATCATTACAG ATACAGGCGG 
TTGCAAGCTT GGGGTAAAGA 
CAATCGAATG ATGCAGCTGA 
AACACAATCT TTGGTATTGG 
AACCCTGATA CAAACTTTGT 
TCTGCAACAT TTAGAGATAA 
ACAAAAACGA ACAAAGTCGG 
CAAGCTGGTT TTGAAAAAGG 
GTTGATACGA AATATGCGGC 
GCAATGTACC AAAACGGCGT 
GTCTTCCAAG AAGCAAAAGA 
GGCGTTGACC GCGATCAAGA 
AACTTCACGT TAACTTCAAC 
CGTGCGTTAG AAGACAAATT 
GGCGTTGACT TAACAGACGG 
AAAGATAAAG TAATCTCAGG 



EF021-2 (SEQ IDNO:78). 

MKKAKL FGFSLIALGL SVSLAACGGG KGKTAESGGG KGDAAHSAVI 

ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
TIFGIGYLLK DAISSAADAN PDTNFVLIDD QIDGKKNWS ATFRDNEAAY LAGVAAANET 
KTNKVGFVGG EEGWIDRFQ AGFEKGVADA AKELGKEITV DTKYAASFAD PAKGKALAAA 
MYQNGVDIIF HASGATGQGV FQEAKDLNES GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
FTLTSTLKGV GTAVQDIANR ALEDKFPGGE HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 
DKVISGDVKV PEKPE 



EF021-3 (SEQ ID NO:79) 
ATGTGGTGGT 

GGCAAAGGCA AAACCGCTGA AAGCGGCGGT GGCAAAGGGG ATGCAGCGCA TAGTGCTGTA 
ATCATTACAG ATACAGGCGG CGTGGATGAC AAGTCGTTCA ACCAATCTTC TTGGGAAGGA 
TTGCAAGCTT GGGGTAAAGA ACATGATTTA CCAGAAGGTT CAAAAGGGTA TGCATATATT 
CAATCGAATG ATGCAGCTGA CTATACAACC AATATTGACC AAGCGGTATC AAGTAAATTC 



PCT7US98/08959 



ACTATTTTTA GGAGGCTTTA 
TGCATTAGGT TTATCAGTTT 
AAGCGGCGGT GGCAAAGGGG 
CGTGGATGAC AAGTCGTTCA 
ACATGATTTA CCAGAAGGTT 
CTATACAACC AATATTGACC 
CTACTTGCTA AAAGATGCAA 
TTTAATCGAT GATCAAATCG 
TGAAGCAGCT TACTTAGCCG 
TTTTGTTGGT GGTGAAGAAG 
TGTGGCTGAT GCTGCGAAAG 
TTCATTTGCT GATCCTGCCA 
TGATATCATC TTCCATGCTT 
CTTGAATGAA TCAGGTTCTG 
TGCTGATGGC AAGTACAAAA 
GCTTAAAGGT GTCGGCACAG 
CCCTGGTGGC GAACATTTAG 
CTATTTAAAC GACAAAACAA 
TGACGTAAAA GTCCCAGAAA 



CAGAAATGAA AAAAGCAAAA 
CACTTGCAGC ATGTGGTGGT 
ATGCAGCGCA TAGTGCTGTA 
ACCAATCTTC TTGGGAAGGA 
CAAAAGGGTA TGCATATATT 
AAGCGGTATC AAGTAAATTC 
TTTCTTCTGC AGCAGATGCC 
ATGGCAAAAA GAATGTCGTT 
GTGTTGCTGC TGCAAATGAA 
GGGTCGTAAT TGACCGTTTC 
AATTAGGTAA AGAAATTACT 
AAGGGAAAGC TTTAGCTGCT 
CTGGTGCGAC TGGACAAGGG 
GCGACAAAGT TTGGGTAATC 
CAAAAGACGG CAAAGAAGAC 
CGGTTCAAGA TATTGCCAAC 
TTTATGGATT AAAAGATGGT 
AAGAAGCTGT TAAAACAGCA 
AACCAGAATA A 
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AACACAATCT TTGGTATTGG CTACTTGCTA AAAGATGCAA TTTCTTCTGC AGCAGATGCC 
AACCCTGATA CAAACTTTGT TTTAATCGAT GATCAAATCG ATGGCAAAAA GAATGTCGTT 
TCTGCAACAT TTAGAGATAA TGAAGCAGCT TACTTAGCCG GTGTTGCTGC TGCAAATGAA 
ACAAAAACGA ACAAAGTCGG TTTTGTTGGT GGTGAAGAAG GGGTCGTAAT TGACCGTTTC 
CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT GCTGCGAAAG AATTAGGTAA AGAAATTACT 
GTTGATACGA AATATGCGGC TTCATTTGCT GATCCTGCCA AAGGGAAAGC TTTAGCTGCT 
GCAATGTACC AAAACGGCGT TGATATCATC TTCCATGCTT CTGGTGCGAC TGGACAAGGG 
GTCTTCCAAG AAGCAAAAGA CTTGAATGAA TCAGGTTCTG GCGACAAAGT TTGGGTAATC 
GGCGTTGACC GCGATCAAGA TGCTGATGGC AAGTACAAAA CAAAAGACGG CAAAGAAGAC 
AACTTCACGT TAACTTCAAC GCTTAAAGGT GTCGGCACAG CGGTTCAAGA TATTGCCAAC 
CGTGCGTTAG AAGACAAATT CCCTGGTGGC GAACATTTAG TTTATGGATT AAAAGATGGT 
GGCGTTGACT TAACAGACGG CTATTTAAAC GACAAAACAA AAGAAGCTGT TAAAACAGCA 
AAAGATAAAG TAATCTCAGG TGACGTAAAA GTCCCAGAAA AACCAGAA 



EF021-4 (SEQ ID NO:80) 

CGGG KGKTAESGGG KGDAAHSAVI 
ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP 
TIFGIGYLLK DAISSAADAN PDTNFVLIDD 
KTNKVGFVGG EEGWIDRFQ AGFEKGVADA 
MYQNGVDIIF HASGATGQGV FQEAKDLNES 
FTLTSTLKGV GTAVQDIANR ALEDKFPGGE 
DKVISGDVKV PEKPE 



EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
QIDGKKNWS ATFRDNEAAY LAGVAAANET 
AKELGKEITV DTKYAASFAD PAKGKALAAA 
GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 



EF022-1 (SEQ ID NO:81) 

TAAGAGCATA AAAAAATGAA GAGTTATAGG AGAAAGAAGA TGAAAAAGTA TTTAAAAATC 
ACAATGGTTT GTATTTTATT GGTAGGATTT TTAGCTGGGT GTACCAATAA AAATGAAAAT 
AAAAAGAAAC AGAAAAATAC CAAAGAAGCC GTTCAACTGA TGTCACCCTC GGAATTAACA 
ACGCTCAACA CCTCTGTATT ATTGGATTTT CCAGATGCTA TTGTCCAAAC TGCAGCGTTT 
GAAGGGTTAT ATAGTTTAGA TGAACAAGAC CAATTGGTAC CAGCCGTAGC AAAAGCATTG 
CCGATGATTT CAGAAGATGG AAAAAC CTAC ACGATTTCTT TGAGAAAAGA AGCGGTTTGG 
AGTAACGATG ATCCTGTCAC AGCACATGAT TTTGAATATG CTTGGAAAAA AATGATTGAT 
CCTAAAAACG GCTTTGTTTA TAGCTTCCTC ATCGTTGAAA CAATTCAAAA TGGTGCAGAA 
ATCTCAGCGG GGAAATTAGC ACCCAATGAA CTAGGTGTCA CAGCTGTGGA TGATTATACA 
TTAAAGGTGA CGCTCAAAGA GCCAAAACCG TACTTTACGT CCTTGTTAGC TTTTCCGACA 
TTTTTCCCGC AJi j^ TcnAAA AGTAGTCGAA CAATTTGGTG CGGACTATGG AACTGCTAGT 
G ATAAAGTC G TCTATAATGG TCCGTTCGTG GTAAAAGATT GGCAGCAAAC AAAGATGGAC 
TGGCAACTAG CAAAAAATAA TCGCTATTGG GATCACCAGA ACGTGCGCTC AGACATTATC 
AATTATACAG TTATCAAAGA AACATCTACC GCATTGAATC TTTTTGAAGA TGGACAATTA 
GATGTGGCTA CACTAAGTGG TGAACTGGCG CAACAGAATA AAAATAATAC GTTGTATCAT 
TCGTATCCAA CAGCGACAAT GAACTATTTG CGCTTAAATC AAAAACGGNA AGGGCAAGCN 
ACGCCGCTTG CAAACGAAAA CCTGCGTAAA GCATTGGCTT TAGGAATAGA TAAAGAAAAT 
CTAGTCAATA ATATTATTGC AGATGGTTCT AAAGCGCTAC ATGGTGCGAT TACGGAAGGC 
TTTGTGGCGA ATCCCACAAC GGGTCTCGAT TTTCGTCAAG AAGCAGGTAA TTTAATGGTT 
TATAACAAAG AAAAAGCGCA AAGTTATTGG AAAAAAGCAC AAGCAGAATT AGGAGAAAAG 
GTTAACGTTG AATTGATGGT AACAGATGAT GGTTCTTACA AAAAAATTGG TGAAAGTTTG 
CAAGGCTCGC TACAAGAATT GTTTCCTGGT TTGACAATAG AGCTAACCGC ATTGCCGACT 
GAAGCTGCAT TGAACTTTGG GCGAGAAAGT GACTATGATT TATTCTTAAT TTACTGGACA 
CCAGACTATC AAGACCCTAT TTCTACCCTG ATGACTTTAT ACAAGGGCAA TGATCGCAAT 
TATCAGAACC CTGTCTATGA CAAATTATTA GATGAAGCAG CCACAACCTA TGCCTTAGAG 
CCAGAAAAAA GATGGGCGAC ACTGATTGCA GCTGAAAAAG AAGTGATTGA AACGACTGCT 
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GGCATGATTC CACTTAGCCA AAATGAACAA ACAGTCCTGC AAAATGATAA AGTCAAAGGC 
TTGAATTTTC ATACCTTTGG CGCTCCATTA ACGTTAAAAA ATGTTTATAA GGAAAAATAA 



EF022-2 {SEQ ID NO:82) 

MKKYLKIT MVCILLVGFL AGCTNKNENK KKQKNTKEAV QLMSPSELTT 
LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ LVPAVAKALP MISEDGKTYT ISLRKEAVWS 
NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 
KVTLKEPKPY FTSLLAFPTF FPQNXKWEQ FGADYGTASD KWYNGPFW KDWQQTKMDW 
QLAKNNRYWD HQNVRSDIIN YTVIKETSTA LNLFEDGQLD VATLSGELAQ QNKNNTLYHS 
YPTATMNYLR LNQKRXGQAT PLANENLRKA LALGIDKENL VNNIIADGSK ALHGAITEGF 
VANPTTGLDF RQEAGNLMVY NKEKAQSYWK KAQAELGEKV NVELMVTDDG SYKKIGESLQ 
GSLQELFPGL TIELTALPTE AALNFGRESD YDLFLIYWTP DYQDPISTLM TLYKGNDRNY 
QNPVYDKLLD EAATTYALEP EKRWATLIAA EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 
NFHTFGAPLT LKNVYKEK 



EF022-3 {SEQ ID NO:83) 
GT GTACCAATAA AAATGAAAAT 

AAAAAGAAAC AGAAAAATAC CAAAGAAGCC GTTCAACTGA TGTCACCCTC GGAATTAACA 
ACGCTCAACA CCTCTGTATT ATTGGATTTT CCAGATGCTA TTGTCCAAAC ■ TGCAGCGTTT 
GAAGGGTTAT ATAGTTTAGA TGAACAAGAC CAATTGGTAC CAGCCGTAGC AAAAGCATTG 
CCGATGATTT CAGAAGATGG AAAAACCTAC ACGATTTCTT TGAGAAAAGA AGCGGTTTGG 
AGTAACGATG ATCCTGTCAC AGCACATGAT TTTGAATATG CTTGGAAAAA AATGATTGAT 
CCTAAAAACG GCTTTGTTTA TAGCTTCCTC ATCGTTGAAA CAATTCAAAA TGGTGCAGAA 
ATCTCAGCGG GGAAATTAGC ACCCAATGAA CTAGGTGTCA CAGCTGTGGA TGATTATACA 
TTAAAGGTGA CGCTCAAAGA GGCAAAACCG TACTTTACGT CCTTGTTAGC TTTTCCGACA 
TTTTTCCCGC AAAATCNAAA AGTAGTCGAA CAATTTGGTG CGGACTATGG AACTGCTAGT 
GATAAAGTCG TCTATAATGG TCCGTTCGTG GTAAAAGATT GGCAGCAAAC AAAGATGGAC 
TGGCAACTAG CAAAAAATAA TCGCTATTGG GATCACCAGA ACGTGCGCTC AGACATTATC 
AATTATACAG TTATCAAAGA AACATCTACC GCATTGAATC TTTTTGAAGA TGGACAATTA 
GATGTGGCTA CACTAAGTGG TGAACTGGCG CAACAGAATA AAAATAATAC GTTGTATCAT 
TCGTATCCAA CAGCGACAAT GAACTATTTG CGCTTAAATC AAAAACGGNA AGGGCAAGCN 
ACGCCGCTTG CAAACGAAAA CCTGCGTAAA GCATTGGCTT TAGGAATAGA TAAAGAAAAT 
CTAGTCAATA ATATTATTGC AGATGGTTCT AAAGCGCTAC ATGGTGCGAT TACGGAAGGC 
TTTGTGGCGA ATCCCACAAC GGG TCTCGAT TTTCGTCAAG AAGCAGGTAA TTTAATGGTT 
TATAACAAAG AAAAAGCGCA AAGTTATTGG AAAAAAGCAC AAGCAGAATT AGGAGAAAAG 
GTTAACGTTG AATTGATGGT AACAGATGAT GGTTCTTACA AAAAAATTGG TGAAAGTTTG 
CAAGGCTCGC TACAAGAATT GTTTCCTGGT TTGACAATAG AGCTAACCGC ATTGCCGACT 
GAAGCTGCAT TGAACTTTGG GCGAGAAAGT GACTATGATT TATTCTTAAT TTAC TGGAC A 
CCAGACTATC AAGACCCTAT TTCTACCCTG ATGACTTTAT ACAAGGGCAA TGATCGCAAT 
TATCAGAACC CTGTCTATGA CAAATTATTA GATGAAGCAG CCACAACCTA TGCCTTAGAG 
CCAGAAAAAA GATGGGCGAC ACTGATTGCA GCTGAAAAAG AAGTGATTGA AACGACTGCT 
GGCATGATTC CACTTAGCCA AAATGAACAA ACAGTCCTGC AAAATGATAA AGTCAAAGGC 
TTGAATTTTC ATACCTTTGG CGCTCCATTA ACGTTAAAAA ATGTTTATAA GGAAAAA 



EF022-4 (SEQ ID NO:84) 

CTNKNENK KKQKNTKEAV QLMSPSELTT 
LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ 
NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI 



LVPAVAKALP MISEDGKTYT ISLRKEAVWS 
VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 
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KVTLKEPKPY FTSLLAFPTF FPQNXKWEQ FGADYGTASD KWYNGPFW KDWQQTKMDW 
QLAKNNRYWD HQNVRSDIIN YTVIKETSTA LNLFEDGQLD VATLSGELAQ QNKNNTLYHS 
YPTATMNYLR LNQKRXGQAT PLANENLRKA LALGIDKENL VNNIIADGSK ALHGAITEGF 
VANPTTGLDF RQEAGNLMVY NKEKAQSYWK KAQAELGEKV NVELMVTDDG SYKKIGESLQ 
GSLQELFPGL TIELTALPTE AALNFGRESD YDLFLIYWTP DYQDPISTLM TLYKGNDRNY 
QNPVYDKLLD EAATTYALEP EKRWATLIAA EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 
NFHTFGAPLT LKNVYKEK 

EF023-1 (SEQ ID NO:85) 

TAAAATGGAG GGATCGGTAT GAAGAAATTA AAAATGTTAG GATGCGTCGG GTTGCTTTTA 
GCTTTAACGG CTTGTCAGGC GGGAACGGGA ' AACTCGGCTG ATAGTAACAA AGCAGCGGAA 
CAAAAAATTG CAATTAGTTC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 
GATACGACCT CGACTTTAGT CATGAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA 
GAAGATGAAT TAGAGTTGGG GGTCGCTGCC GAAGAACCAG CGATTTCTGA AGATGAAACC 
GTTTATACAT TTAAGATTAG AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA 
AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA 
GCTTTATTTT TTGATGTCAT TAAAAATGCT AAGGAAATTG CTTTAGAAGG CGCAGATGTG 
AATACTCTTG GGGTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC 
ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT 
ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT 
TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT 
ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 
CCAACGACAG CGGTGAATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 
GGTGAATTTA TTCCTGGTTA TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC 
ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 
TTGCAAGATC AATCAACGGC TACAGATCAA. GTAATTCCGC CGGGACAAAC GATTGCGCCA 
GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATAC A 
GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 
TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT 
CAATTGGAAG AAAATCTAGA TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT 
CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAACCGAT 
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA 
ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 
GATCTGTATT GGCATTCATT TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAACTAA 



EF023-2 (SEQ ID NO:86) 

MKKLK MLGCVGLLLA LTACQAGTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD 
TTSTLVMNQV YEGLYVLGKE DELELGVAAE EPAISEDETV YTFKIREDAK WSNDDPVTAN 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERPT 
PYLKSLLSFP VLFPQNEKYI KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
YFLKMNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD 
LYWHSFGPTY SLKWAYVN 
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EF023-3 (SEQ ID NO:87) 

GGGAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA ■ 

CAAAAAATTG CAATTAGTTC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 
GATACGACCT CGACTTTAGT CATGAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA 
GAAGATGAAT TAGAGTTGGG GGTCGCTGCC GAAGAACCAG CGATTTCTGA AGATGAAACC 
GTTTATACAT TTAAGATTAG AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA 
AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA 
GCTTTATTTT TTGATGTCAT TAAAAATGCT AAGGAAATTG CTTTAGAAGG CGCAGATGTG 
AATACTCTTG GGGTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC 
ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT 
ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT 
TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT 
ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 
CCAACGACAG CGGTGAATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 
GGTGAATTTA TTCCTGGTTA TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC 
ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 
TTGCAAGATC AATCAACGGC TACAGATCAA GTAATTC CGC CGGGACAAAC GATTGCGCCA 
GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC C TACGATAC A 
GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 
TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTCCAATTT 
CAATTGGAAG AAAATCTAGA TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT 
CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAACCGAT 
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA 
ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 
GATCTGTATT GGCATTCATT TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAAC 



EF023-4 (SEQ ID NO: 88-) 

GTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD 

TTSTLVMNQV YEGLYVLGKE DELELGVAAE EPAISEDETV YTFKIREDAK WSNDDPVTAN 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERPT 
PYLKSLLSFP VLFPQNEKYI KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
YFLKMNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD 
LYWHSFGPTY SLKWAYVN 

EF024-1 (SEQ ID NO:89) 

TAATGGCCGT TTCGTCTACT AATAAAGAGG ATGAAGCTAC TCAAATGGCG TTGGCAATGG 
AACAAGGATC ATAAAAAAGG AGAAGTGAGC ATGAAAAAAG TACTACCTTT TATTGCC TTA 
GTCGGCTTGT TATTGTTGTC AGGTTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 
GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA CTTTTTTTGA TGACGGTAAA 
TTTTCAGCTA ATGACTCAGA GGATAGTGTT AGTGGGAC AT ACACTTATGA TGAAAAAAAT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA TTATGGAAAA AGTNGANTNC 
AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA GAACACTTAT AAAACAAAAA 
ACAGAATAA 
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EF024-2 (SEQ ID NO: 90) 

M KKVLPFIALV GLLLLSGCGT DMKKILTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 



EF024-3 (SEQ ID NO:91) 
ATT GACTGCCGAT 

GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA CTTTTTTTGA TGACGGTAAA 
TTTTCAGCTA ATGACTCAGA GGATAGTGTT AGTGGGACAT ACACTTATGA TGAAAAAAAT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA TTATGGAAAA AGTNGANTNC 
AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA GAACACTTAT AAAACAAAAA 
ACAGAA 



EF024-4 (SEQ ID NO: 92) 
LTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 

EF025-1 (SEQ ID NO: 93) 

TGAATGAAAC ATATTAAAGG AATGTTGGTT TTTATCGGAT TATTTATTTT GGTTGGTTGT 
GCGCCAGATC AAGAGCCAAC GAAACAAACA ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG 
AAGCAAGTTA CCGTCACCAA TCAAACGACT TCTGCGGTGG AAAAACAAGC GCCGACTAAA 
AATGACGAAC TGATTGCTAA TCAATTGACT TTTGATTCTC ATGAATACAC GTACGAAGTG 
GTTACAGGGG CCACACAAAC GACATTTGGA ACAACCCCAC CAGCAAAATA TACACCGGAA 
GAAAAAAAGA AAAAAATGTT TTGGTCCAAT CAACCGCCTT TGGGATTAAT GACGGGTAAC 
TATTATAAAA ATGAAGGTGT ATTTACTGGC GGAAATTACG GCATTGTAGA GATTATTACG 
GAACCTGAAA CGCAAAGGAT TCTGAATGTT GAGTTTACAG AGTTTGCTAG TGATCCTTAT 
TATGATACAC GCTATTCGGG TGTCAACAAA CGCCTGTCGG ATTATCCTGA ATTTCAAGCA 
AGCAACACGC GTACAGACGA TACGTTAGTC ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
AAACAAATGC GTGACGAAAA TCGTGTTACA GGTAATTTTT ATACGGTACG CGGTTCATCA 
ACTTCTGCGC GTGAAGGATT AATGCCTTTA GCAGCAGAGA TGGACACTTG GCTAAAAGAG 
CCATCGAAAG AAACGTATAT CGGTTACGCA GAAGATTTAG GCAATGGCCT AATCGCTCGA 
CTTCAAGTGA TAACAGAAGA GCAGAAAATA AAACATGTCA GCTATGATGA ATACTTTTCA 
GATGAACAGG AAAAAATCAC AGAAACAGCC TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCACCAGG ATACAATAAA CAAACCAACA ATTCTTTTAT TCATTTTGTA G 

EF025-2 (SEQ ID NO: 94) 

MKH IKGMLVF IGLFILVGCA PDQEPTKQTT SGPQETKQVK QVTVTNQTTS AVEKQAPTKN 

DELIANQLTF DSHEYTYEW TGATQTTFGT TPPAKYTPEE KKKKMFWSNQ PPLGLMTGNY 

YKNEGVFTGG NYGIVEIITE PETQRILNVE FTEFASDPYY DTRYSGVNKR LSDYPEFQAS 

NTRTDDTLVT WNGITYVEK QMRDENRVTG NFYTVRGSST SAREGLMPLA AEMDTWLKEP 

SKETYIGYAE DLGNGLIARL QVITEEQKIK HVSYDEYFSD EQEKITETAC GLFIVNRNII 
HQDTINKPTI LLFIL 



EF025-3 (SEQ ID NO: 95) 
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AAC GAAACAAACA ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG 

AAGCAAGTTA CCGTCACCAA TCAAACGACT TCTGCGGTGG AAAAACAAGC GCCGACTAAA 
AATGACGAAC TGATTGCTAA TCAATTGACT TTTGATTCTC ATGAATACAC GTACGAAGTG 
GTTACAGGGG CCACACAAAC GACATTTGGA ACAACCCCAC CAGCAAAATA TACACCGGAA 
GAAAAAAAGA AAAAAATGTT TTGGTCCAAT CAACCGCCTT TGGGATTAAT GACGGGTAAC 
TATTATAAAA ATGAAGGTGT ATTTAC TGG C GGAAATTACG GCATTGTAGA GATTATTACG 
GAACCTGAAA CGCAAAGGAT TCTGAATGTT GAGTTTACAG AGTTTGCTAG TGATCCTTAT 
TATGATACAC GCTATTCGGG TGTCAACAAA CGCCTGTCGG ATTATCCTGA ATTTCAAGCA 
AGCAACACGC GTACAGACGA TACGTTAGTC ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
AAACAAATGC GTGACGAAAA TCGTGTTACA GGTAATTTTT ATACGGTACG CGGTTCATCA 
ACTTCTGCGC GTGAAGGATT AATGCCTTTA GCAGCAGAGA TGGACACTTG GCTAAAAGAG 
CCATCGAAAG AAACGTATAT CGGTTACGCA GAAGATTTAG GCAATGGCCT AATCGCTCGA 
CTTCAAGTGA TAACAGAAGA GCAGAAAATA AAACATGTCA GCTATGATGA ATACTTTTCA 
GATGAACAGG AAAAAATCAC AGAAACAGCC TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCACCAGG ATACAATAAA CAAACCAACA ATTCTTTTAT TCATTTTG 

EF025-4 (SEQ ID NO:96) 

TKQTT SGPQETKQVK QVTVTNQTTS AVEKQAPTKN 

DELIANQLTF DSHEYTYEW TGATQTTFGT TPPAKYTPEE KKKKMFWSNQ PPLGLMTGNY 
YKNEGVFTGG NYGIVEIITE PETQRILNVE FTEFASDPYY DTRYSGVNKR LSDYPEFQAS 
NTRTDDTLVT WNGITYVEK QMRDENRVTG NFYTVRGSST SAREGLMPLA AEMDTWLKEP 
SKETYIGYAE DLGNGLIARL QVITEEQKIK HVSYDEYFSD EQEKITETAC GLFIVNRNII 
HQDTINKPTI LLFIL 



EF026-1 (SEQ ID NO: 97) 

TGAGTGTATG ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA 
TTTTTTGGAG GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT 
GCTGCTCTTG TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT. CAACATCGTT 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA 
AAAGATGGTG CC AC TGTTAC TGGTGAAGTC ACCGTAGCCA ATAATTAA 

EF026-2 (SEQ ID NO: 98) 

MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 

EF026-3 (SEQ ID NO: 99) 

AACAGATAG TAGTTCTAGT 

AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
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TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 

GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTGTT CTGAAAATTT CAACATCGTT 

CACGGGACTG TCAAAGGTGA . TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 

AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TG AC TTAGAA 

AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAAT 

EF026-4 {SEQ ID NO:100) 

TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 

EF027-1 (SEQ ID NO:101) 

TTTGGTATGA AACAGAAAAA GTGGTTAATC GGACTTGTTG CACTGGGCTT GGTTTTAGCA 
GCATGTGGAA GTGGCGGTTC GAAAACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC 
GTCGCATCTG GTGGTGAACT CTCGACATTA GACAGCGCTC ATTATACAGA TGTCTATAGT 
TCCGATATGA TTGGTCAAGT AGTTGAAGGC TTGTATCGAC AAGATAAAAA CGGAGATCCT 
GAGCTAGCTA TGGCGAAAGC AGAGCCACAA GTTAGTGAAG ACGGGTTAGT CTATACATTC 
AAGTTACGAG AAGCAAAATG GACAAACGGG GATCCAGTTA AAGCAGGGGA TTTTGTAGTT 
GCGTTTAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA GCAGTAATCA AATGGATATT 
TTTAAAAATG GGCGTGCGGT GCGGGAAGGA CAAGCCACGA TGGAAGAATT TGGTGTCAAA 
GCAATCGATG ACCAGACACT AGAACTAACA TTGGAAAATC CAATTCCTTA TTTAGCCCAA 
GTCTTGGTTG GGACACCTTT TATGCCTAAA AATGAAGCCT TTGCCAAAGA AAAAGGTACT 
GCCTATGGGA CTTCTGCAGA TAATTTTGTT GGCAATGGGC CGTTTGTAAT TTCAGGTTGG 
GATGGCAATT CCGAAACTTG GAAATTGAAG AAGAATGATC ATTATTGGGA TAAAGAACAC 
GTAAAATTGA ATGAAATTGA TGTTCAAGTA GTGAAAGAAA TTGGCACAGG AGCCAATCTT 
TTTGATAATG GCGACTTAGA TTACACTGTT TTAGCAGATA CTTATGCACT TCAGTATAAA 
GAGTCAAAAC AAGCGCATTT TGTACCTAAA GCCATGGTGG GTTATTTAAG CCCCAATCAT 
CGCCGTGAAA TTACCGGCAA CGAACATGTT CGAAAAGCTT TTTTACAAGC GATTGACAAA 
GAAACTTTTG CAAAAGAAAT TTTAGGAGAT GGCTCGACAG CTTTAAATGG NTTTGTACCA 
GCTAATTTTG CAAAAATCCA GATACAGGTG AAGATTTCCG CAAAGAAAAT GGTGATTTAT 
TGCCATATAA TATTAAAGAA GCCCAAGCTA ACTGGAACAA TT 

EF027-2 (SEQ ID NO: 102) 

MKQKKWLI GLVALGLVLA ACGSGGSKTT SNEPATQKIN VASGGELSTL DSAHYTDVYS 
SDMIGQWEG LYRQDKNGDP ELAMAKAEPQ VSEDGLVYTF KLREAKWTNG DPVKAGDFW 
AFRNWDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK AIDDQTLELT LENPIPYLAQ 
VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSETWKLK KNDHYWDKEH 
VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK ESKQAHFVPK AMVGYLSPNH 
RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP ANFAKIQIQV KISAKKMVIY 
CHIILKKPKL TGTI 

EF027-3 (SEQ ID NO:103) 

AACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC 

GTCGCATCTG GTGGTGAACT CTCGACATTA GACAGCGCTC ATTATACAGA TGTCTATAGT 
TCCGATATGA TTGGTCAAGT AGTTGAAGGC TTGTATCGAC AAGATAAAAA CGGAGATCCT 
GAGCTAGCTA TGGCGAAAGC AGAGCCACAA GTTAGTGAAG ACGGGTTAGT CTATACATTC 
AAGTTACGAG AAGCAAAATG GACAAACGGG GATCCAGTTA AAGCAGGGGA TTTTGTAGTT 
GCGTTTAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA GCAGTAATCA AATGGATATT 
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TTTAAAAATG GGCGTGCGGT GCGGGAAGGA CAAGCCACGA TGGAAGAATT TGGTGTCAAA 
GCAATCGATG ACCAGACACT AGAACTAACA TTGGAAAATC CAATTCCTTA TTTAGCCCAA 
GTCTTGGTTG GGACACCTTT TATGCCTAAA AATGAAGCCT TTGCCAAAGA AAAAGGTACT 
GCCTATGGGA CTTCTGCAGA TAATTTTGTT GGCAATGGGC CGTTTGTAAT TTCAGGTTGG 
GATGGCAATT CCGAAACTTG GAAATTGAAG AAGAATGATC ATTATTGGGA TAAAGAACAC 
G T AAAATTG A ATGAAATTGA TGTTCAAGTA GTGAAAGAAA TTGGCACAGG AGCCAATCTT 
TTTGATAATG GCGACTTAGA TTACACTGTT TTAGCAGATA CTTATGCACT TCAGTATAAA 
GAGTCAAAAC AAGCGCATTT TGTACCTAAA GCCATGGTGG GTTATTTAAG CCCCAATCAT 
CGCCGTGAAA TTACCGGCAA CGAACATGTT CGAAAAGCTT TTTTACAAGC GATTGACAAA 
GAAAC TTTTG CAAAAGAAAT TTTAGGAGAT GGCTCGACAG CTTTAAATGG NTTTGTACCA 
GCTAATTTTG CAAAAATCCA GATACAGGTG AAGATTTCCG CAAAGAAAAT GGTGATTTAT 
TGCCATATAA TATTAAAGAA GCCCAAGCTA A 

EF027-4 (SEQ ID NO:104) 

TT SNEPATQKIN VASGGELSTL DSAHYTDVYS 

SDMIGQWEG LYRQDKNGDP ELAMAKAEPQ VSEDGLVYTF KLREAKWTNG DPVKAGDFW 
AFRNWDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK AIDDQTLELT LENPIPYLAQ 
VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSETWKLK KNDHYWDKEH 
VRLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK ESKQAHFVPK AMVGYLSPNH 
RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP ANFAKIQIQV KISAKKMVIY 
CHIILKKPKL 



EF028-1 {SEQ ID NO: 105) 

TAACAGAAGC AATACAACAA CTTAACACTT 
AAGACTTGTT ATAGTCAATG TATGGGTAGA 
AGAGCTTTGC TAGGGGTTAC CTTATTAACA 
TCTGAACAGA AAAGCGGCGA AAAACAAACA 
GAAAAAGCAT CAGTAAAAAA TGTTATTTTT 
ACAACGGGCT ATCGCTATTT CAAAGCCAAT 
TTTGATACCT ATTTGGTCGG ACAGCAAGCC 
ACCGATTCAG CTTCCGCAGC GACAGCGATG 
ATTGCACTCG ATAATGACAA GTCCAAAACA 
GGGAAATCAA CGGGTCTTGT AGCAACATCT 
GGCGCACATA ATGTTTCACG CAAAAATATG 
CAAATCGACG GACAACACAA AGTCGATGTG 
CGGAAAGATC GTGATTTAGT CAAAGAATTT 
AAAAAGTCGT TAAATGAGAA CCAAGACGAC 
CTACCTAAAA TGATTGACCG AACGGAAGAA 
GCTCTTCAAC GGTTAGATAA AAATGAAAAA 
ATTGATTGGG CCGGGCATAG CAATGATATT 
GAAGCGGCGT TTGAAAAGGC CATCGATTTT 
CAACTGCAGA TCATTCAACA GGGGGCTTGT 

EF028-2 (SEQ ID NO:106) 

MKKR ALLGVTLLTF TTLAGCTNLS 
EQKSGEKQTE VAEAKATESE KASVKNVIFM 
DTYLVGQQAT YPEDEEENVT DSASAATAMA 
KSTGLVATSE ITHATPAAYG AHNVSRKNMA 
KDRDLVKEFS QAGYGHVTDK KSLNENQDDK 
LQRLDKNEKG FFLMVEGSQI DWAGH SND I V 



TGTTTACTTG TTATTTATCA GAAATCAACT 
TATGAAGGAG GAAACAAGGA AATGAAGAAA 
TTCACAACAT TAGCGGGTTG TACAAATTTA 
GAGGTTGCTG AAGCGAAGGC AACTGAATCT 
ATGATTGGAG ATGGCATGGG GAATCCGTAT 
CACTCAGACA AGCGTGTTCC CCAAACAGCT 
ACTTATCCAG AAGATGAAGA AGAGAATGTC 
GCTGCCGGAG TGAAAACCTA TAATAATGCT 
GAAACAGTGC TCGAACGTGC GAAAAAAGTG 
GAAATAACAC ATGCAACCCC TGCTGCATAT 
GCAGAAATCG CCGATGACTA TTTTGATGAT 
TTACTTGGCG GCGGCTCCGA ATTATTTGCC 
TCCCAAGCGG GTTATGGTCA TGTCACAGAC 
AAAATTTTAG GCTTGTTTGC ACCAGGCGGG 
GTCCCTTCAT TAGCTGATAT GACAGAAGCG 
GGTTTCTTTT TAATGGTTGA AGGTAGTCAA 
GTTGGCGCGA TGAGCGAAAT GCAAGACTTC 
GCCAAAAAAG ATGGTGAACA TTGGTGGTTA 
CTTTAG 



IGDGMGNPYT TGYRYFKANH SDKRVPQTAF 
AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
ILGLFAPGGL PKMIDRTEEV PSLADMTEAA 
GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 
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LQIIQQGACL 

EF028-3 (SEQ ID NO:107) 

ACAGA AAAGCGGCGA AAAACAAACA GAGGTTGCTG AAGCGAAGGC AACTGAATCT 
GAAAAAGCAT CAGTAAAAAA TGTTATTTTT ATGATTGGAG ATGGCATGGG GAATCCGTAT 
ACAACGGGCT ATCGCTATTT CAAAGCCAAT CACTCAGACA AGCGTGTTCC CCAAACAGCT 
TTTGATACCT ATTTGGTCGG ACAGCAAGCC ACTTATCCAG AAGATGAAGA AGAGAATGTC 
ACCGATTCAG CTTCCGCAGC GACAGCGATG GCTGCCGGAG TGAAAACCTA TAATAATGCT 
ATTGCACTCG ATAATGACAA GTCCAAAACA GAAACAGTGC TCGAACGTGC GAAAAAAGTG 
GGGAAATCAA CGGGTCTTGT AGCAACATCT GAAATAACAC ATGCAACCCC TGCTGCATAT 
GGCGC AC ATA ATGTTTCACG CAAAAATATG GCAGAAATCG CCGATGACTA TTTTGATGAT 
CAAATCGACG GACAACACAA AGTCGATGTG TTACTTGGCG GCGGCTCCGA ATTATTTGCC 
CGGAAAGATC GTGATTTAGT CAAAGAATTT TCCCAAGCGG GTTATGGTCA TGTCACAGAC 
AAAAAGTCGT TAAATGAGAA CCAAGACGAC AAAATTTTAG GCTTGTTTGC ACCAGGCGGG 
CTACCTAAAA TGATTGACCG AACGGAAGAA GTCCCTTCAT TAGCTGATAT GACAGAAGCG 
GCTCTTCAAC GGTTAGATAA AAATGAAAAA GGTTTCTTTT TAATGGTTGA AGGTAGTCAA 
ATTGATTGGG CCGGGCATAG CAATGATATT GTTGGCGCGA TGAGCGAAAT GCAAGACTTC 
GAAGCGGCGT TTGAAAAGGC CATCGATTTT GCCAAAAAAG ATGGTGAACA TTGGTGGTTA 
CAACTGCAGA TCATTCAACA GGGGGCTTGT CTT 

EF028-4 (SEQ ID NO:108) . 

QKSGEKQTE VAEAKATESE KASVKNVIFM IGDGMGNPYT TGYRYFKANH SDKRVPQTAF 
DTYLVGQQAT YPEDEEENVT DSASAATAMA AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
KSTGLVATSE ITHATPAAYG AHNVSRKNMA EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
KDRDLVKEFS QAGYGHVTDK KSLNENQDDK ILGLFAPGGL PKMIDRTEEV PSLADMTEAA 
LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 
LQIIQQGACL 

EF029-1 (SEQ ID NO:109) 

TGAAGGAGGG AGAAAATGAA AAAGTTAATC GGTAAAAAGT GGCTGCTGCT TACAGCAGTA 
GCCACTTTTT TATTATCAGG ATGCGCAAGT CTTGAACAAA AAGCACAGGA TAGTGTAAAA 
GAAGTTACTG AAAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT 
GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG ACGGAAAAGA ACAAAAAGTT 
CGGTTTTTAT TAATTGACAC ACCCGAGACT GTGAAACCGA AAACAAAAGT TCAGCCGTTC 
GGATTGGAAG CTAGCAAACG CACAAAAGAG CTTTTGTCTA CTGCTTCAGA AATTACGTTT 
GAATATGATA AGGGCGATAA AACAGATCGT TACGGACGAG CGTTGGGCTA CATATTCGTA 
GATGGAACAT TACTACAAAA AACGCTTGTA AGTGAAGGAT TAGCTCGTGT TGCCTATGTA 
AAAGAGCCTA CAACTAAGTA TTTGGCAGAA CTAGAGCAAG CCCAAGAACA GGCTAAAAAT 
GAGTCACTCG GAATCTGGAG CATACCAGGT TATGTGACAC AACGGGGGTT TAGTAAATAA 

EF029-2 (SEQ ID NO: 110) 

MKKLIG KKWLLLTAVA TFLLSGCASL EQKAQDSVKE VTENVTQTIS NDQRIPADFV 
RHVDGDTTVL KIDGKEQKVR FLLIDTPETV KPKTKVQPFG LEASKRTKEL LSTASEITFE 
YDKGDKTDRY GRALGYIFVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 
SLGIWSIPGY VTQRGFSK 

EF029-3 (SEQ ID NO:lll) 

AAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT 

GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG ACGGAAAAGA ACAAAAAGTT 
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CGGTTTTTAT TAATTGACAC ACCCGAGACT GTGAAACCGA AAACAAAAGT TCAGCCGTTC 
GGATTGGAAG CTAGCAAACG CACAAAAGAG CTTTTGTCTA CTGCTTCAGA AATTACGTTT 
GAATATGATA AGGGCGATAA AACAGATCGT TACGGACGAG CGTTGGGCTA CATATTCGTA 
GATGGAACAT TACTACAAAA AACGCTTGTA AGTGAAGGAT TAGCTCGTGT TGCCTATGTA 
AAAGAGCCTA CAACTAAGTA TTTGGCAGAA CTAGAGCAAG .CCCAAGAACA GGCTAAAAAT 
GAGTCACTCG GAATCTGGAG CATACCAGGT TATGTGACAC AACGGGGGTT TAGTAAA 



EF029-4 (SEQ ID NO: 112) 

NVTQTIS NDQRIPADFV 
RHVDGDTTVL KIDGKEQKVR FLLIDTPETV 
YDKGDKTDRY GRALGYIFVD GTLLQKTLVS 
SLGIWSIPGY VTQRGFSK 

EF030-1 (SEQ ID NO:113) 



KPKTKVQPFG LEASKRTKEL LSTASEITFE 
EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 



TGATTGACAC ATAGGGGGAA TAGTATGAAA AAGTTAAAAA TGATGGGGAT TATGTTATTT 
GTTAGTACGG TCTTGGTAGG TTGTGGCACA ACAGCAGANA CAAAAATAGA CGAGAAAGCA 
ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT TTAAATTTAA TGGAGAACTC GGAAATCGGT 
TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACAGTC CAATGTCTTT 
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG 
CCAGAAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 
TCCAATGGTG ATGCTGTAAC AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 
CCCAAAAACC AAGCCAATTA CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACAGCT 
ATTACAAAAG AGGAAAAAGC ACCAGAGGAA TTGGGTGTCA AAGCGCTTGA TGATTATACT 
TTGGAGGTTA CTTTAGAAAA GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA 
TTTTTCCCAC AAAACGAAGC ATTCGTGAAA GAAAAAGGAC AAGCCTATGG CACTTCTAGT 
GAAATGATTG TATC TAATGG TCCGTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG 
TGGGATTTTG TGCGTAATCC CTACTATTAC GATAAAGAAA AAGTAAAATC AGAAACGATT 
CATTTTGAAG TTCTTAAAGA AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA 
GATGTGGCTG TCTTAACAGG AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA 
GCAATCGAAC GGTCAAAAGT CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA 
TCCATTTTTG CAAATGAGAA TGTCCGCAAA GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT 
TTAGTCGATA ATATTTTAGC AGATGGCTCA AAAGAAATTT ATGGGTACAT TCCAGAAAAA 
TTTGTATATA ACCCAGAAAC GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 
ACAGACGCCA AAAAAGCCAA AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GTAGCCATTG AACTTCTTTC AAGAGATGGT GATAGTGACC GA 

EF030-2 (SEQ ID NO: 114) 

MKK LKMMGIMLFV STVLVGCGTT AXTKIDEKAT EKTSVSKKVL NLMENSE I GS 

MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ LIPAAAKEMP EISEDGKRYT IKLREDGKWS 

NGDAVTANDF VFAWRKLANP KNQANYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL 

EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 

DFVRNPYYYD KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTGDFAK QNRDNPDYEA 

IERSKVYSLR LNQKRNEKPS IFANENVRKA LAYALDKKSL VDNILADGSK EIYGYIPEKF 

VYNPETNEDF RQEAGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDGD SDR 

EF030-3 (SEQ ID NO: 115) 

GAGAAAGCA 

ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT TTAAATTTAA TGGAGAACTC GGAAATCGGT 
TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACAGTC CAATGTCTTT 
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG 



WO 98/50554 



PCT/US98/08959 



111 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

CCAGAAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 
TCCAATGGTG ATGCTGTAAC AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 
CCCAAAAACC AAGCCAATTA CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACAGCT 
ATTACAAAAG AGGAAAAAGC ACCAGAGGAA TTGGGTGTCA AAGCGCTTGA TGATTATACT 
TTGGAGGTTA CTTTAGAAAA GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA 
TTTTTCCCAC AAAACGAAGC ATTCGTGAAA GAAAAAGGAC AAGCCTATGG CACTTCTAGT 
GAAATGATTG TATCTAATGG TCCGTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG 
TGGGATTTTG TGCGTAATCC CTACTATTAC GATAAAGAAA AAGTAAAATC AGAAACGATT 
CATTTTGAAG TTCTTAAAGA AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA 
GATGTGGCTG TCTTAACAGG AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA 
GCAATCGAAC GGTCAAAAGT CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA 
TCCATTTTTG CAAATGAGAA TGTCCGCAAA GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT. 
TTAGTCGATA ATATTTTAGC AGATGGCTCA AAAGAAATTT ATGGGTACAT TCCAGAAAAA 
TTTGTATATA ACCCAGAAAC GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 
ACAGACGCCA AAAAAGCCAA AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GTAGCCATTG AACTTCTTTC AAGAGATGGT 

EF030-4 (SEQ ID NO: 116) 



EKAT EKTSVSKKVL NLMENSEIGS 

MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ LIPAAAKEMP EISEDGKRYT IKLREDGKWS 
NGDAVTANDF VFAWRKLANP KNQANYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL 
EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 
DFVRNPYYYD KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTGDFAK QNRDNPDYEA 
IERSKVYSLR LNQKRNEKPS IFANENVRKA LAYALDKKSL VDNILiADGSK EIYGYIPEKF 
VYNPETNEDF RQEAGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDG 

EF031-1 (SEQ ID NO: 117) 

TGAGAAATTA GTTATTTTAG AAAAATAAAA ACCATTTTGG AGGAAGATTT AAAAATGAAA 
AAACGCGTAA TTTTAGGGAC ATTAGTCGCT GCAACGTTAT TAATGACTGC TTGTGGAAAC 
AGCGAAGCAA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTTTAGT CGTTTCAACT 
TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT 
GAAGCGAAAG TTACCTTAGA AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT 
AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA 
AAAGATGGGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG 
CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC 
GGGATTGTTT AGAACAAAGA AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG 
TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA 
ATGTTATACG TTGC TAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC 
GCTTTTGAAG CGATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA 
GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC TGATTTTGCG 
GTTGATATTA TTCAAGGCGC ACAGAAAACG TGA 

EFO031-2 (SEQ ID NO: 118) 

MKK RVI LGTLVAA TLLMTACGNS EATTKSESKG GSNALWSTF 

GLSEDIVKKD I I APFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQKT 



EF031-3 (SEQ ID NO:119) 
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AA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTTTAGT CGTTTCAACT 
TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT 
GAAGCGAAAG TTACCTTAGA AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT 
AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA 
AAAGATGGGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG 
CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC 
GGGATTGTTT ACAACAAAGA AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG 
TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA 
ATGTTATACG TTGCTAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC 
GCTTTTGAAG CGATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA 
GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC TGATTTTGCG 
GTTGATATTA TTCAAGGCGC ACAGAAAA 

EF031-4 (SEQ ID NO:120) 

TTKSESKG GSNALWSTF 

GLSEDIVKKD I IAPFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQK 



EF032-1 (SEQ ID NO:121) 

TGAATAAATT ATTTAGGAGG AATTATGATG AAAAAATTAA TTAGTTTAGG ATTGGTTTGT 
GTTTGTGGTA TTTCACTACT TACTGCTTGT NCGGGAAATA ATGATAATAA AGATACTGAA 
AAGTCAACCA GTCAATCTAG CAGCACAGTT AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
TCAGGGGAAT ATTCAGTTGG AAAAGATATT GATCCTGGAG ATTACTATGC TGTATTAACT 
CAACTAGATG ATAAATCGAG CATAGTTCTT ATTACCGTCA AATCAGGCGG AGAAAATAGT 
AACCATGACT TATACGGAGT GGGAAACAAG AAAAAAGTAT CTCTTAAAAA GGGAG ATACT 
CTCACATTCG AAACTGCCGA CAAAGATTTT GTTGTTAGAT TTTTAAATGA AAAAGATTTT 
CAAGAATATA TGAAAAATCC AGTATCNAGT ACTGAAACTA GCAAACANAA AACAGTAAAC 
TCTGATGTTT CTAAAAGTAG TAGCCAAGAT AATAAACAAT CTGATGTATC TGAAAAAAAA 
GAAGTAAGTA CTGAAGCGAA GTCTGATGTA GC TACTAATA CTTTACCGAG CGAAGATAAA 
AATAC TAATG ACATTACTAA GCTAGCAGAT GAGCCAACCT TAGAACAACA AACCGTCTTA 
GATACTTTAG CTAAGCATCA ATTTAATGAT ATGTATCCTT ATAAAGGAAG CAAAATGCAT 
TCAATTATCG GCGTCATCCC AACCATGGAC GCAAAAAGAT GGTAA 



EF032-2 (SEQ ID NO: 122) 

MK KLISLGLVCV CGISLLTACX GNNDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
VS TEAKS DVA TNTLPSEDKN TNDITKLADE PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 
IIGVIPTMDA KRW 



EF032-3 (SEQ ID NO: 123) 

TA. ATGATAATAA AGATACTGAA 
AAGTCAACCA GTCAATCTAG CAGCACAGTT 
TCAGGGGAAT ATTCAGTTGG AAAAGATATT 



AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
GATCCTGGAG ATTACTATGC TGTATTAACT 
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CAACTAGATG ATAAATC GAG CATAGTTCTT ATTACCGTCA AATCAGGCGG AG AAAATAG T 
AACCATGACT TATACGGAGT GGGAAACAAG AAAAAAGTAT CTCTTAAAAA GGG AG ATACT 
CTCACATTCG AAACTGCCGA CAAAGATTTT GTTGTTAGAT TTTTAAATGA AAAAGATTTT 
CAAGAATATA TGAAAAATCC AGTATCNAGT ACTGAAACTA GCAAACANAA AACAGTAAAC 
TCTGATGTTT CTAAAAGTAG TAGCCAAGAT AATAAACAAT CTGATGTATC TGAAAAAAAA 
GAAGTAAGTA CTGAAGCGAA GTCTGATGTA GCTACTAATA CTTTACCGAG CGAAGATAAA 
AATACTAATG ACATTACTAA GCTAGCAGAT GAGCCAACCT TAGAACAACA AACCGTCTTA 
GATACTTTAG CTAAGCATCA ATTTAATGAT ATGTATCCTT ATAAAGGAAG CAAAATGCAT 
TCAATTATCG GCGTCATCCC AACCATGGAC GCAAAAAGAT GG 



EF032-4 (SEQ ID NO: 124) 

NDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST 
VS TEAKS DVA TNTLPSEDKN TNDITKLADE 
IIGVIPTMDA KRW 



TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 



EF033-1 (SEQ ID NO: 125) 

TGACTGCTTT TTTTCTATTG GAGAAAAAAG 
CAAAGGAGGT TCATTTCAGA AAATTTTCCC 
AAAATGAAAA AATTTACTTT AACAATGATG 
GCAGGATGTG GTAAACAGGA AAAGAAAGCA 
TTACCAACCA AAGACCGTAG CGGCAAAGAA 
ATTTCCCTAG TGCCATCAAC AACAGAAGTG 
ATCGCAGTTG ATACTCAAAG TAGTACAATG 
GATATGATGG CTGTCGATGC CGAAAAATTG 
AATGACATCA ATTTAGCTAG CTCAGAAAGT 
ACAGTCGTTA ATATCCCCAC TAGTACAAGC 
ATCGC.TGATA GCTTATC TGA ACATGAAAAA 
GAAATCGACG AGTAG 



TGGTTTTTTT GTATTGTTTT GACGTTGAGA 
CAAAATAAAA TAGACGAATG CGAGGATGAA 
ACTTTAGGTT TAGTAGCAAC ACTTGGCTTA 
ACTACCTCTT CTGAAAAAAC AGAAGTAACG 
ATTACTTTAC CCAAAGAAGC AACCAAAATT 
ATTGAAGACT TAGGTAAAAC CGACCAATTA 
ATGACTGATT TAAAAAAATT ACCACAAATG 
ATTGCCTTGA AACCACAAAT TGTTTATGTG 
GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 
ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
GGACAAAAGT TAATCAAAAC AATGGATCAA 



EF033-2 (SEQ ID NO:126) 
MKKFTLTMMT LGLVATLGLA 

GCGKQEKKAT TSSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI EDLGKTDQLI 

AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV WKQVEDAGIT 

WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE 

EF033-3 (SEQ ID NO:127) 

CTCTT CTGAAAAAAC AGAAGTAACG 

TTACCAACCA AAGACCGTAG CGGCAAAGAA ATTACTTTAC CCAAAGAAGC AACCAAAATT 

ATTTCCCTAG TGCCATCAAC AACAGAAGTG ATTGAAGACT TAGGTAAAAC CGACCAATTA 

ATCGCAGTTG ATACTCAAAG TAGTACAATG ATGACTGATT TAAAAAAATT ACCACAAATG 

GATATGATGG CTGTCGATGC CGAAAAATTG ATTGCCTTGA AACCACAAAT TGTTTATGTG 

AATGACATCA ATTTAGCTAG CTCAGAAAGT GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 

ACAGTCGTTA ATATCCCCAC TAGTACAAGC ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
ATCGCTGATA GCTTATC TGA ACATGAAAAA GGACAAAAGT TAATCAAAAC AATGGATCAA 
GAAATCGACG AGTAG 
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EF033-4 (SEQ ID NO: 128) 

SSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI EDLGKTDQLI 

AVDTQSSTMM TDLKKLPQMD MMAVDAEKLI ALKPQIVYVN DINLASSESV WKQVEDAGIT 

WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE 

EF034-1 (SEQ ID NO:129) 

TAGGAGGGAG TAATCATGAA AAAAATCGGG TATTTTAGTT GTATTATTTT TTTCATGTTT 
TTGGTAGGTT GTAGTAATAA CAAAAAAGAA AACGGCAATC TTTTGAATGC CAGTTCGTTT 
CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TAACGAAAGG TTCAATTTTT 
TTCAACAAGG ATAAAACCAT GACGCTTGAA AAAGAATATT TAGTTAATCC CAATAATGAA 
GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AAAATATTAA AATACAAGAA 
AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA AATAGAGTTT 
AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA TGGTGATTTT 
GGTGGTAAAT AG 

EF034-2 (SEQ ID NO: 130) 

MKKIGY FSCIIFFMFL VGC SNNKKEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF 
NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 

EF034-3 (SEQ ID NO: 131) 

AGAA AACGGCAATC TTTTGAATGC CAGTTCGTTT 

CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TAACGAAAGG TTCAATTTTT 
TTCAACAAGG ATAAAACCAT GACGCTTGAA AAAGAATATT TAGTTAATCC CAATAATGAA 
GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AAAATATTAA AATACAAGAA 
AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA AATAGAGTTT 
AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA TGGTGATTTT 
GGTGGTAAAT AG 

EF034-4 (SEQ ID NO: 132) 

KEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF 

NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 

EF035-1 (SEQ ID NO:133) 

TAAACGAGAG GTGAGTTTAT GAAAACAAAA ATCGGAAAAA CAGTTATCTT GTCAGCATTT 
TTATTCACAA GTTTCCTTTT ACTGAGTGGT TGTACCTCGG CTGGCGAAGA GATGGAAAAA 
ACAATTGATC GACAGAAAGA AAAAGTCGAT AAAACGGTCG ATAAGCAGAA ACATAAAAAT 
GAAAATTCCA TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTAGATAG TCAAGAAGAC 
AAAATCGATA CTACTGAGTA A 

EF035-2 (SEQ ID NO:134) 

MKTKI GKTVILSAFL FTSFLLLSGC TSAGEEMEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 
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EF035-3 (SEQ ID NO: 135) 
GATGGAAAAA 

ACAATTGATC GACAGAAAGA AAAAGTC GAT AAAACGGTCG ATAAGCAGAA ACATAAAAAT 
GAAAATTCCA TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTAGATAG TCAAGAAGAC 
AAAATCGATA CTACTGAG 

EF035-4 (SEQ ID NO:136) 



MEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 



EF036-1 (SEQ ID NO:137) 

TAATTTTCAA GTCCTACATA TAATGGTAAA ATAGAATGGA TTGAAATTAA TTGGAGGAAT 
AATGAATCGA TGAAAAAAAG ATTGCTATTA TTTATTGGTT TGGCAAGTAT ACTTACTTTG 
ACAGGATGTG CAAAATGGAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 
TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 
ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 
GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 
ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 
AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA 
GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 
CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 
GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 
GCATTTTCAT ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA 
GATGAAAATG TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT GTACACTCGT 
AAAAATCCAA GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAGA TGATATCCAA 
GAACGTGTGA TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT TGAACGGGAT 
TGGCAAGGAA ATGTCATTAA ATAA 

EF-36-2 (SEQ ID NO: 138.) 

MKKRLLLF IGLASILTLT GCAKWIDRGE SITAVGSSAL 

QPLVETASEE YQSQNPGRFI NVQGGGSGTG LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI 
DHKVAWGIT PIVNKNVGVK DISMENLKKI FLGEVTNWKE LGGKDQKIVI LNRAAGSGTR 
ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS DTPGAISYTA FSYVTDEVAT LSIDGVQPTD 
ENVMNNKWII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE RVIGQLGYIP VSKMEIERDW 
QGNVIK 

EF036-3 (SEQ ID NO:139) 

GAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 

TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 
ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 
GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 
ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 
AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA 
GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 
CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 
GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 



WO 98/50554 



PCT7US98/08959 



116 

TABLE 1. Nucleotide and Amino Acid Seqeuences of Kfaecalis Genes. 

GCATTTTCAT ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA 
GATGAAAATG TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT GTACACTCGT 
AAAAATCCAA GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAGA TGATATCCAA 
GAACGTGTGA TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT • TGAACGGGAT 
TGGCAAGGAA ATGTCATTAA A 



EF036-4 (SEQ ID NO: 140) 
IDRGE SITAVGSSAL 

QPLVETASEE YQSQNPGRFI NVQGGGSGTG 
DHKVAWGIT PIVNKNVGVK DISMENLKKI 
ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS 
ENVMNNKWII WSYEHMYTRK NPSDLTKEFL 
QGNVIK 



LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI 
FLGEVTNWKE LGGKDQKIVI LNRAAGSGTR 
DTPGAISYTA FSYVTDEVAT LSIDGVQPTD 
DFMLSDDIQE RVIGQLGYIP VSKMEIERDW 



EF037-1 (SEQ ID NO: 141) 

TGAGTGTATG ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA 
TTTTTTGGAG GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT 
GCTGCTCTTG TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAA 

EF037-2 (SEQ ID NO: 142) 

MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 

EF037-3 (SEQ ID NO: 143) 



AACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT 
AATGTCACTT TTGATAAAGA GGTAACAGTT 
TCCAACGATG TCTATCGTAA ATTAGCACTT 
GCTGAATATG AAATCACGGT TCCTAAGCTA 
CACGGGACTG TCAAAGGTGA TATTGAGGTG 
AAAGTTAATG GCAATATTAC TTTTGATAAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC 

EF037-4 (SEQ ID NO: 144) 



GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
AAAGGAAATT GGATTGTCGC AGCTACTGAC 
GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TATTCCCAAG ATGATAATAA AAAAGTAACT 
ATCGTTTCTT CTGAAAATTT CAACATCGTT 
AAAGCAAATG GCTTTACTTT AAATGGTACC 
CAAGAATACA AAGATTCTGC TGACTTAGAA 
ACCGTAGCCA A 



TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI 



GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
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VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 



EF038-1 (SEQ ID NO:145) 

TAATGGCCAT TTCGTCTACT AATAAAGAGG ATGAAGCTAC TCAAATGGCG TTGGCAATGG 
AACAAGGATC ATAAAAAAGG AGAAGTGAGC ATGAAAAAAG TACTACCTTT TATTGCCTTA 
GTCGGCTTGT TATTGTTGTC AGGTTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 
GGTGGTAAAT GGGAACTAGA AAATAAAAGT CCAACTACTA CTTACACTTT TTTTGATGAT 
GAAACTTTTT CGAGGTATAA TTCAAAAATT AGTGATAGTG GAACGTACTC TTACGATGAA 
AATAATAAAA AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT GGAAAATGTT 
GAATATAAAG ACGGTAAATT AAAAGGTGAA ATTGGAGGCG AGAAGGACTC TGATAAAAAA 
TNGAATAAGA GGTGTCTTTG A 

EF038-2 (SEQ ID NO:146) 

M KLLKWRWQWN KDHKKGEVSM KKVLPFIALV GLLLLSGCGT DMKKILTADG 
GKWELENKSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 
YKDGKLKGEI GGEKDSDKKX NKRCL 

EF038-3 (SEQ ID NO: 147) 

TTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 

GGTGGTAAAT GGGAACTAGA AAATAAAAGT CCAACTACTA CTTACACTTT TTTTGATGAT 
GAAACTTTTT CGAGGTATAA TTCAAAAATT AGTGATAGTG GAACGTACTC TTACGATGAA 
AATAATAAAA AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT GGAAAATGTT 
GAATATAAAG ACGGTAAATT AAAAGGTGAA ATTGGAGGCG AGAAGGACTC TGATAAAAAA 
TNGAATAAGA GGTGTCTTTG A 



EF038-4 (SEQ ID NO: 148) 
CGT DMKKILTADG 

GKWELENKSP TTTYTFFDDE TFSRYNSKIS 
YKDGKLKGEI GGEKDSDKKX NKRCL 



DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 



EF039-1 (SEQ ID NO:149) 

TAAATATATC AAAAAGAAAA AAGGGGATTA CCAACCATGA AAAAGAAAAA AGTTTTTAGT 
GCGCTTACCT TATTAACCTT TAGTACGTTG TTGATTGCAG GCTGTGCTGG CGGAGCCAAC 
TCTGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 
GCAGCTAAAG AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATCCAG TACTGATTGG 
CAAGGCACAA AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT 
ATTGGTTTAG CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA 
GGTGAAACCC GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC 
TTAATTTCGG ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GACGAAAGAT 
AAATTTACCT ATAAGCGAAT GGGTAAAGAT AAAGACGGGA AAGATGTAGA AGTCTTTGTA 
GAACATATCC CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA 
AC AGAAAC TG GCAAGATTGT TACCAATGAA CCTGGGGATG ACATTTTAGG GGCCACATTA 
TGGAATGGCA CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG 
TTTATTAGTT TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA 
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT 
GCTCACGTTT CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT 
GATAAACGTT TTACGTATAC ACGAATGGGT AAAGACAACA ATGGCAAAGA AATTAAAGTC 
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TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 
EF039-2 (SEQ ID NO:150) 

MKKKKVFSA LTLLTFSTLL IAGCAGGANS ATDKSSAASS STAVSSSAEA 
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANFI GLAKYDGETG FYEFFDKETG 
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE 
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF 
ISLAKFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVSIGDNKYG AALELTELND 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF 

EF039-3 (SEQ ID NO:151) 

TGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 
GCAGCTAAAG AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATCCAG TACTGATTGG 
CAAGGCACAA AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT 
ATTGGTTTAG CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA 
GGTGAAACCC GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC 
TTAATTTCGG ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GACGAAAGAT 
AAATTTACCT ATAAGC GAAT GGGTAAAGAT AAAGACGGGA AAGATGTAGA AGTCTTTGTA 
GAACATATCC CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA 
ACAGAAACTG GCAAGATTGT TACCAATGAA CCTGGGGATG ACATTTTAGG GGCCACATTA 
TGGAATGGCA CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG 
TTTATTAGTT TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA 
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT 
GCTCACGTTT CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT 
GATAAACGTT TTACGTATAC ACGAATGGGT AAAGACAACA ATGGCAAAGA AATTAAAGTC 
TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 

EF039-4 (SEQ ID NO: 152) 

ATDKSSAASS STAVSSSAEA 
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANFI GLAKYDGETG FYEFFDKETG 
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE 
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF 
ISLAKFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVSIGDNKYG AALELTELND 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF 

EF040-1 (SEQ ID NO: 153) 

TAGATTAGAA CCACTGGAGA AAAATCTCAT ATTTCTCTCG AGGAAAGGAA GTTGAGCACA 
ATGAACAAAA AAATTTTAAT GGGGC TATTA AGTGTCGTGA CCATTCCATT ACTTGCTGCG 
TGTCAAGGAG GAGAAACACC TTCCGCAGCG TCAAAAAATA GTCAAACGGT GACTACTCAA 
AGTAGTGCAA AAAC TGAAAG CACCAGTACA ACCCGTTCGG TAGCTCAAAC AACATCAAAA 
GAGGAAGTGA AAGAACCGAT GAAGACCTAT GAAGTGGGTG CGCTTTTAGA AGCAGCCAAT 
CAACGAGATA CGAAGAAGGT CAAGGAAATT TTACAAGATA CTACTTATCA AGTGGATGAA 
GTCGACACAG AAGGCAACAC ACCGCTCAAT ATCGCTGTTC ACAATAATGA CATTGAGATT 
GCAAAAGCGT TGATTGATCG GGGTGCCGAT ATTAATCTGC AAAACAGCAT TAGTGATAGT 
CCCTATCTTT ATGCGGGAGC GCAAGGACGT ACGGAGATTT TAGCGTATAT GTTAAAACAT 
GCGACCCCAG ATTTAAATAA GCATAACCGT TACGGTGGCA ATGCGTTAAT TCCGGCAGCT 
GAAAAAGGAC ATATTGACAA TGTGAAGCTC TTGTTAGAAG ATGGACGAGA AGACATAGAT 
TTCCAAAATG ACTTTGGCTA TACAGCATTG ATTGAGGCAG TGGGGTTACG TGAAGGGAAC 
CAACTTTACC AAGATATTGT AAAATTGTTA ATGGAAAATG GTGCGGATCA ATCCATTAAA 
GACAATTCTG GTCGAACAGC AATGGACTAT GCCAATCAAA AAGGTTATAC GGAAATTAGT 
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AAAATTTTAG CACAGTACAA CTAA 
EF040-2 (SEQ ID NO:154) 

M NKKILMGLLS WTIPLLAAC QGGETPSAAS KNSQTVTTQS 

SAKTESTSTT RSVAQTTSKE EVKEPMKTYE VGALLEAANQ RDTKKVKEIL QDTTYQVDEV 
DTEGNTPLNI AVHNNDIEIA KALIDRGADI NLQNSISDSP YLYAGAQGRT EILAYMLKHA 
TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LEDGREDIDF QNDFGYTALI EAVGLREGNQ 
LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN 

EF040-3 (SEQ ID NO:155) 

AGCG TCAAAAAATA GTCAAACGGT GACTACTCAA 

AGTAGTGCAA AAACTGAAAG CACCAGTACA ACCCGTTCGG TAGCTCAAAC AACATCAAAA 
GAGGAAGTGA AAGAACCGAT GAAGACCTAT GAAGTGGGTG CGCTTTTAGA AGCAGCCAAT 
CAACGAGATA CGAAGAAGGT 'CAAGGAAATT TTACAAGATA CTACTTATCA AGTGGATGAA 
GTCGACACAG AAGGCAACAC ACCGCTCAAT ATCGCTGTTC ACAATAATGA CATTGAGATT 
GCAAAAGCGT TGATTGATCG GGGTGCCGAT ATTAATCTGC AAAACAGCAT TAGTGATAGT 
CCCTATCTTT ATGCGGGAGC GCAAGGACGT ACGGAGATTT TAGCGTATAT GTTAAAACAT 
GCGACCCCAG ATTTAAATAA GCATAACCGT TACGGTGGCA ATGCGTTAAT TCCGGCAGCT 
GAAAAAGGAC ATATTGACAA TGTGAAGCTC TTGTTAGAAG ATGGACGAGA AGACATAGAT 
TTCCAAAATG ACTTTGGCTA TACAGCATTG ATTGAGGCAG TGGGGTTACG TGAAGGGAAC 
CAACTTTACC AAGATATTGT AAAATTGTTA ATGGAAAATG GTGCGGATCA ATCCATTAAA 
GACAATTCTG GTCGAACAGC AATGGACTAT GCCAATCAAA AAGGTTATAC GGAAATTAGT 
AAAATTTTAG CACAGTACAA C 

EF040-4 (SEQ ID NO: 156) 

AS KNSQTVTTQS 

SAKTESTSTT RSVAQTTSKE EVKEPMKTYE VGALLEAANQ RDTKKVKEIL QDTTYQVDEV 

DTEGNTPLNI AVHNNDIEIA KALIDRGADI NLQNSISDSP YLYAGAQGRT EILAYMLKHA 

TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LEDGREDIDF QNDFGYTALI EAVGLREGNQ 

LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN 



■EF041-1 (SEQ ID NO: 157) 

TAATTATTAA NTTCTGATTT TTCAGAAAAT 
ATGAAATTGA AAAAGTCATT AACATTCGGT 
GCGGCTTGTG GAGGCGGCGG AACGTCAGAT 
AGTGGCGAAC AAGTTTTACG TGTCACAGAA 
CTAGCAACAG NCAGAATTAG TTTTATTGCA 
TTAGACAAAG ATAACAAAGT CCAACCTGCA 
GATGGACTAA CATACAAAAT TAAATTAAAT 
GTGACTGCTA ATGACTATGT TTACGGATGG 
GAATATGCTT ATCTGTATGC CTCTGTAAAA 
GATAAATCAG AATTAGGAAT TAAAGCAGTC 
AAAGCAACAC CATACTTTGA TTACTTATTA 
GACATTGTGG AAAAATATGG TAAAAATTAT 
GGTCCATTCG TCTTAGACGG CTTTGATGGT 
AAAAACGATC AATATTGGGA TAAAGATACT 
GTGAAAGAAT CACCAACCGC GTTGAACTTG 
CTTTCTGGTG AATTAGCCCA ACAAATGGCC 
GCATCAACAC AATATATGGA ACTAAATCAA 



ACAGATTGCA TTATTTTAGG AGGCAACACT 
GTGATTACAT TATTTAGCGT AACAACTTTA 
AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 
CAACAAGAAA TGCCAACAGC TGATTTATCA 
TTAAATAATG TATATGAAGG AATTTATCGT 
GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA 
AAAGATGCAA AATGGTCAGA CGGTAAACCA 
CAACGAACAG TTGATCCAGC GACAGCTTCT 
AATGGTGATG CCATTGCTAA AGGGGAAAAA 
AGTGATACAG AATTAGAAAT CACTTTAGAA 
GCTTTCCCAT CATTCTTCCC GCAACGTCAA 
GCATCAAACA GCGAAAGTGC TGTCTACAAT 
CCTGGTACAG ATACAAAATG GTCATTCAAG 
GTGAAACTGG ACTCAGTAGA TGTGAATGTC 
TTCCAAGATG GACAAACAGA CGATGTCGTT 
AATGACCCAG CTTTTGTTAG TCAAAAAGAA 
CGTGATGAAA AATCACCATT TAGAAATGCG 
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AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT 
AGGGGATGG 



EF041-2 (SEQ ID NO: 158) 

M KLKKSLTFGV ITLFSVTTLA ACGGGGTSDS SSASGGGKAS • 

GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 
GLTYKIKLNK DAKWS DGKP V TANDYVYGWQ RTVDPATASE YAYLYASVKN GDAIAKGEKD 
KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 
PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV KLDSVDVNW KESPTALNLF QDGQTDDWL 
SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN LRKAISYSID RKALVESILR 
GW 



EF041-3 (SEQ ID NO;159) 

TTGTG GAGGCGGCGG AACGTCAGAT AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 
AGTGGCGAAC AAGTTTTACG TGTCACAGAA CAACAAGAAA TGCCAACAGC TGATTTATCA 
CTAGCAACAG NCAGAATTAG TTTTATTGCA TTAAATAATG TATATGAAGG AATTTATCGT 
TTAGACAAAG ATAACAAAGT CCAACCTGCA GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA 
GATGGACTAA CATACAAAAT TAAATTAAAT AAAGATGCAA AATGGTCAGA CGGTAAACCA 
GTGACTGCTA ATGACTATGT TTACGGATGG CAACGAACAG TTGATCCAGC GACAGCTTCT 
GAATATGCTT ATCTGTATGC CTCTGTAAAA AATGGTGATG CCATTGCTAA AGGGGAAAAA 
GATAAATCAG AATTAGGAAT TAAAGCAGTC AGTGATACAG AATTAGAAAT CACTTTAGAA 
AAAGCAACAC CATACTTTGA TTACTTATTA GCTTTCCCAT CATTCTTCCC GCAACGTCAA 
GACATTGTGG AAAAATATGG TAAAAATTAT GCATCAAACA GCGAAAGTGC TGTCTACAAT 
GGTCCATTCG TCTTAGACGG CTTTGATGGT CCTGGTACAG ATACAAAATG GTCATTCAAG 
AAAAACGATC AATATTGGGA TAAAGATACT GTGAAACTGG ACTCAGTAGA TGTGAATGTC 
GTGAAAGAAT CACCAACCGC GTTGAACTTG TTCCAAGATG GACAAACAGA CGATGTCGTT 
CTTTCTGGTG AATTAGC C C A ACAAATGGCC AATGAC C C AG CTTTTGTTAG TCAAAAAGAA 
GCATCAACAC AATATATGGA ACTAAATCAA CGTGATGAAA AATCACCATT TAGAAATGCG 
AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT 
AGGGGATGG 



EF041-4 (SEQ ID NO:160) 

CGGGGTSDS SSASGGGKAS 
GEQVLRVTEQ QEMPTADLSL ATXRISFIAL 
GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ 
KSELGIKAVS DTELEITLEK ATPYFDYLLA 
PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV 
SGELAQQMAN DPAFVSQKEA STQYMELNQR 
GW 

EF044-1 (SEQ ID NO:161) 



NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 
RTVDPATASE YAYLYASVKN GDAIAKGEKD 
FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 
KLDSVDVNW KESPTALNLF QDGQTDDWL 
DEKSPFRNAN LRKAISYSID RKALVESILR 



TAAGATAAAA TTAGTTATAG CGTCTATAGG AGGAATAGTA TGAAAAAATT AGTTTGTGTT 

ATTTTAGTTA TTTTTTTAAC AGGTTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 

GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 

ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA 

ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 

GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA 

GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 

TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT 
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GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA . TATTTTCAGC GGCCGATAAC 
AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 
GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TTCGATATCT 
TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA 
GTAATTCAAT AA 

EF044-2 (SEQ ID NO: 162) 

MKKLVCVI LVIFLTGCSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI 

ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYAVDLSSL NNPLVFNFKG 

MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE LSINTIPTKE IRIFSAADNS 

IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV 

IQ 

EF044-3 (SEQ ID NO:163) 

TTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 

GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 
ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA 
ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 
GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA 
GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 
TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT 
GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC GGCCGATAAC 
AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 
GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TTCGATATCT 
TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA 
GTAATTCAA 



EF044-4 (SEQ ID NO: 164) 

CSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI 

ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYAVDLSSL NNPLVFNFKG 
MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE LSINTIPTKE IRIFSAADNS 
IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV 
IQ 

EF045-1 (SEQ ID NO:165) 

TAGCCAAAAA ATGAGGGAGG AAAAGAGATG AACAAGAAAC GGATTTTAGG TGCAATCACG 
TTAGCTTCTG TGTTAGTATT CGGGTTAGCT GCATGTGGTG GCGGCAATAA AGGCGGGGGC 
AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
G AC TATAC AG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
CATGATGGTA AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTGGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA 



WO 98/50554 



PCT/US98/08959 



122 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 

GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG 
AAAGGCTATA CGC TTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 
GACGTAGACG GCGATGGCAT TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT 
GCTTCAATGT CAGGCGGCGA AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG 
AAAGAAATTG GCTTAAACGT AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC 
TATGATAAAT TGAAAAATGA TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGGGGCACA 
GGTTCAGATC CTTCACCAAC CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT 
TTTGAGTCAG AAGAAAATAC TAAATTACTT GATGCGATTG ATTCAAAAGC ATCATTTGAT 
GAAGAAAAAC GTAAAAAAGC CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT 
GTAATCCCAA CGCTTTACAG AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT 
ACTTGGGCAG TTGATACGAA AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA 
CGGAAATAA 

EF045-2 (SEQ ID NO:166) 

MN KKRILGAITL ASVLVFGLAA CGGGNKGGGN KATETEDISK MPIAVKNDKK 
AIDGGTLDVA WMDTQFQGL FQQEFYQDNY DAQYMLPTVQ PLFNNDADFK IVDGGPADLK 
LDEDANTATI KLRDNLKWSD GKDVTADDVI FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH 
DGKSPTISGI EKVNDKEVKI TYKEVHPGMQ QLGGGVWGSV LPKHAFEGIA VKDMESSDAV 
RKNPVTIGPY YMSNIVTGES VEYLPNEHYY GGKPKLDKLV FKSVPSASIV EAMKAKQYDI . 
ALSMPTDTYP TYKDTEGYQI LGRPEQAYTY IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ 
AMGYAIDNDA VGQKFYNGLR TGATTLIPPV FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD 
VDGDGIREDK EGKPLEIKFA SMSGGETAQP LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 
DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG PNSAFNYTRF ESEENTKLLD AIDSKASFDE 
EKRKKAFYDW QEYAIDEAFV IPTLYRNEVL PVNDRWDFT WAVDTKDNPW ATVGVTADSR 
K 

EF045-3 (SEQ ID NO: 167) 
ATGTGGTG GCGGCAATAA AGGCGGGGGC 

AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
GACTATACAG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
CATGATGGTA AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTGGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA 
GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG 
AAAGGCTATA CGCTTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 



WO 98/50554 PCT/US98/08959 

123 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

GACGTAGACG GCGATGGCAT TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT 
GCTTCAATGT CAGGCGGCGA AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG 
AAAGAAATTG GCTTAAACGT AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC 
TATGATAAAT TGAAAAATGA TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGGGGCACA 
GGTTCAGATC CTTGACCAAC CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT 
TTTGAGTCAG AAGAAAATAC TAAATTACTT GATGCGATTG ATTCAAAAGC ATCATTTGAT 
GAAGAAAAAC GTAAAAAAGC CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT 
GTAATCCCAA CGCTTTACAG AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT 
ACTTGGGCAG TTGATACGAA AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA 
CGGAAA 



EF045-4 (SEQ ID NO: 168) 

CGGGNKGGGN KATETEDISK MP I AVKNDKK 
AIDGGTLDVA WMDTQFQGL FQQEFYQDNY 
LDEDANTATI KLRDNLKWSD GKDVTADDVI 
DGKSPTISGI EKVNDKEVKI TYKEVHPGMQ 
RKNPVTIGPY YMSNIVTGES VEYLPNEHYY 
ALSMPTDTYP TYKDTEGYQI LGRPEQAYTY 
AMGYAIDNDA VGQKFYNGLR TGATTLIPPV 
VDGDGIREDK EGKPLEIKFA SMSGGETAQP 
DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG 
EKRKKAFYDW QEYAIDEAFV IPTLYRNEVL 



DAQYMLPTVQ PLFNNDADFK IVDGGPADLK 
FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH 
QLGGGVWGSV LPKHAFEGIA VKDMESSDAV 
GGKPKLDKLV FKSVPSASIV EAMKAKQYDI 
IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ 
FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD 
LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 
PNSAFNYTRF ESEENTKLLD AIDSKASFDE 
PVNDRWDFT WAVDTKDNPW ATVGVTADSR 



EF046-1 (SEQ ID NO:169) 

TAGGAGGATA TAATGAAAAA AAAACTTATT 
TGTAGTAATA ATACTGGGGG AAAAAATAGC 
CAGCAAACTA CCCAGTCTTC TAAAAAAGAT 
ACATCATCTA TAACAATTGA AACAACCGAG 
GATGATGTTT CAAAAACTAG ACGACAATTG 
ACGGATAAAG AACTAAAGGA ATATATATCA 
AATTATATTA AGCAAAAA 



GTACTATTGT TAGCCTTATT TTTAACGGCA 
GACGCTTCAT CTACTGAAGT ATCAACTAAG 
AGTAGTAATC CGGACACAAC ACCAACTTCT 
AATTTAAAGA ATAGAGAATT GAATCCAACA 
TATGAACAAG GAATTAACAG TTCAACAATT 
GAGGCTAAAG AACAAAAGAA AGATGTCATT 



EF046-2 (SEQ ID NO:170) 

MKKKLIV LLLALFLTAC SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST 
SS ITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN 
YIKQK 



EF046-3 (SEQ ID NO:171) 
A 

TGTAGTAATA ATACTGGGGG AAAAAATAGC 
CAGCAAACTA CCCAGTCTTC TAAAAAAGAT 
ACATCATCTA TAACAATTGA AACAACCGAG 
GATGATGTTT CAAAAACTAG ACGACAATTG 
ACGGATAAAG AACTAAAGGA ATATATATCA 
AATTATATTA AGCAAAAA 



GACGCTTCAT CTACTGAAGT ATCAACTAAG 
AGTAGTAATC CGGACACAAC ACCAACTTCT 
AATTTAAAGA ATAGAGAATT GAATCCAACA 
TATGAACAAG GAATTAACAG TTCAACAATT 
GAGGCTAAAG AACAAAAGAA AGATGTCATT 
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EF046-4. (SEQ ID NO: 172) 

C SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST 

SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN 
YIKQK 

EF047-1 (SEQ ID NO:173) 

TAGGGAAAAC AAGGAGGAAT TCTTATGAAA AAGATAGGGC TTATTTCTAG TGCTTTTCTT 
TTAACCCTTG CTTTAGCAGC ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 
TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 
AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 
TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 
TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 
AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 
TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TCCGTGTTTT ATATTATGCA 
GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 
TTTGAGAAAA AAACATATGG CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 
GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 
GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 
AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 
GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 
GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 

EF047-2 (SEQ ID NO: 174) 

MKK IGLISSAFLL TLALAACGGG KSTENTDSRS SAAESTTVES TKASATKESS 
SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN 
MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAIPVNDAR VNQLTPISSF 
EKKTYG S DAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA- GSSYLSWQEG NWSLWRASN 
INGESPDDLA KNWNILEQE TLPAPNTVGQ ITLNVAGTTD YNRNSWWQA GTWYSVHHF 
DPIQAVKMAT SM 

EF047-3 (SEQ ID NO : 175 ) 

ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 

TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 
AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 
TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 
TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 
AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 
TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TCCGTGTTTT ATATTATGCA 
GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 
TTTGAGAAAA AAACATATGG CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 
GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 
GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 
AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 
GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 
GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 



EF047-4 (SEQ ID NO: 17 6) 
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CGGG KSTENTDSRS SAAESTTVES TKASATKESS 

SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN 
MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAIPVNDAR VNQLTPISSF 
EKKTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA GSSYLSWQEG NWSLWRASN 
INGESPDDLA KNWNILEQE TLPAPNTVGQ ITLNVAGTTD YNRNSWWQA GTWYSVHHF 
DPIQAVKMAT SM 



EF048-1 (SEQ ID NO:177) 

TAAGGAGAAA AGTTCATGAA AAAAAGAAAG GTTTTATTTA CAGCAGTTAT GGTATTGGCA 
GGATTACAGT TGCTAAGTGG TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC 
TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 
TTAGGAGGAG AAACAGAACT AATTGAATTA ACACAAACAG GTGCAATTGA TTTTGCAAAG 
GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA 
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGGTTCGCCA 
GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA 
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-2 {SEQ ID NO: 178) 

MKKRKV LFTAVMVLAG LQLLSGCGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 
LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAKV SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 

EF048-3 (SEQ ID NO: 179) 

TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC 

TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 
TTAGGAGGAG AAACAGAACT AATTGAATTA. ACACAAACAG GTGCAATTGA TTTTGCAAAG 
GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 
GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA 
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATGGTAG AACTTTTAGG GGGTTCGCCA 
GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA 
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
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AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-4 (SEQ ID NO:180) 

CGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 

LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAKV SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV ' YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 



EF049-1 (SEQ ID NO:181) 

TGAGACTCTT TCTTTTTCAA AATGAGGTAT GGTATAGTTA TAACAGANAT AAAACTANAA 
AAAACAGGAG TGCATAAGAG AATGAAGAAA AAACTAATCT TAGCTGCAGC GGGCGCAATG 
GCCGTTTTTA GTTTAGCAGC GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 
TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 
AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC 
TTAAAAACAG CTTGGGCAAG TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT 
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA 
ATTGCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT 
GATTCACAAG CAACAACTGT TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT 
GTAGTGAAAA TGACGAAAAA CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG 
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA 
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA 
AGCAAAACAA CAGAATCTTC TTCTAAATAA 

EF049-2 (SEQ ID NO:182) 

MKKK LILAAAGAMA VFSLAACSSG SKDIATMKGS 

TITVDDFYNQ IKEQSTSQQA FSQMVIYKVF EEKYGDKVTD KXIQKNFDEA KEQVEAQGGK 
FSDALKQAGL TEKTFKKQLK QRAAYDAGLK AHLKITDEDL KTAWASFHPE VEAQIIQVAS 
EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT KKDGGKIKFD SQATTVPAEV KEAAFKLKDG 
EVSEPIAATN MQTYQTTYYV VKMTKNKAKG NDMKPYEKEI KKIAEETKLA DQTFVSKVIS 
DELKAANVKI KDDAFKNALA GYMQTESSSA SSEKKESKSS DSKTSDTKTS DSEKATDSSS 
KTTESSSK 

EF049-3 (SEQ ID NO:183) 

GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 

TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 
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AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC 
TTAAAAACAG CTTGGGCAAG TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT 
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA 
ATTGCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT 
GATTCACAAG CAACAACTGT TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT 
GTAGTGAAAA TGACGAAAAA CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG 
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA 
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA 
AGCAAAACAA CAGAATCTTC TTCTAAATAA 

EF049-4 (SEQ ID NO: 184) 



CSSG SKDIATMKGS 

TITVDDFYNQ IKEQSTSQQA FSQMVIYKVF 
FSDALKQAGL TEKTFKKQLK QRAAYDAGLK 
EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT 
EVSEPIAATN MQTYQTTYYV VKMTKNKAKG 
DELKAANVKI KDDAFKNALA GYMQTESSSA 
KTTESSSK 



EF050-1 (SEQ ID NO:185) 

TAGGGTCTGG AAAAGCAGTC AACTGACTTC 
AAAGGATGNA AAAAAATGAA CATGCCCAAA 
CTTGTTCTAT TATTAAGTGC TTGCCAAATT 
GCCACAAAAG AAGCAACTGT TGAGTTAAAC 
GGTTACGCAG GAACTAAAAA TTCGTTTGGC 
GCCACAACTC AAGAATTAGT GCTACTCGTT 
GGAGCTTTAA GTGGCAAAGC GACGAATCCC 
AACAATGAAT GGAATCAAAC AGAATGGATA 
TATCAAGTGA ACAAAGCCAA TATTGTCGGG 
TATTTAGGAA CCTATGGGCA AGATACATCG 
GGAGCACCTT TCAATGATTT TATTGATACG 
GAAAACGGCC CCACAGAAAA AAGTAGCCGC 
GTTCCAGAAA AACTGCCCAT TTTATTAATT 
GATGGAACGG TGCCGTTATC TAGTGCCTTA 
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA 
AATCCTGAAG TAGATCAATT GCTAATCGAA 



EEKYGDKVTD KXIQKNFDEA KEQVEAQGGK 
AHLKITDEDL KTAWASFHPE VEAQIIQVAS 
KKDGGKIKFD SQATTVPAEV KEAAFKLKDG 
NDMKPYEKEI KKIAEETKLA DQTFVSKVIS 
SSEKKESKSS DSKTSDTKTS DSEKATDSSS 



TTTTCCAAGC CCTTTTTTAG TTCATCGCAG 
AATATCNGTT ATTTTTCTTT GCTAATGGGT 
GGGGCAACTA CGAAGGATGA CAACCAAGCC 
CGCACAACAA CACCAACGCT TTTTTTTCAT 
TCGTTACTGC ATCGCTTGGA GAAACAAGGT 
AAACCTGATG GGACCGTGGT TAAAGAGCGA 
AGTGTTCAAG TTCTATTTGA AGATAATAAA 
AAAAACACAT TACTCTATTT ACAAAAAAAT 
CACTCTATGG GTGGTGTTAG TGGTTTACGT 
TTACCTAAAA TTGAAAAATT CGTCAGCATT 
AGTCAACAGC AAACCATCGA AACGGAACTA 
TATTTGGATT ATCAAGAGAT GATTAATGTT 
GGTGG TCAAT TAAGTCCAAC AGATTTAAGT 
GCAGTCAACG CCTTGCTAAG ACAGCGAGGA 
GAAAATGCAC AACATAGTCA ATTACATGAA 
TTTCTATGGC CGAGTAAAAA ATAG 



EF050-2 (SEQ ID NO:186) 



MNMPKN IXYFSLLMGL VLLLSACQIG ATTKDDNQAA 

TKEATVELNR TTTPTLFFHG YAGTKNSFGS LLHRLEKQGA TTQELVLLVK PDGTWKERG 
ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQKNY QVNKANIVGH SMGGVSGLRY 
LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE NGPTEKSSRY LDYQEMINW 
PEKLPILLIG GQLSPTDLSD GTVPLSSALA VNALLRQRGT QVTSQIIKGE NAQHSQLHEN 
PEVDQLLIEF LWPSKK 



EF050-3 (SEQ ID NO: 187) 
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TTGCCAAATT GGGGCAACTA CGAAGGATGA CAACCAAGCC 

GCCACAAAAG AAGCAACTGT . TGAGTTAAAC CGCACAACAA CACCAACGCT TTTTTTTCAT 
GGTTACGCAG GAACTAAAAA TTCGTTTGGC TCGTTACTGC ATCGCTTGGA GAAACAAGGT 
GCCACAACTC AAGAATTAGT GCTACTCGTT AAACCTGATG GGACCGTGGT TAAAGAGCGA 
GGAGCTTTAA GTGGCAAAGC GACGAATCCC AGTGTTCAAG TTCTATTTGA AGATAATAAA 
AACAATGAAT GGAATCAAAC AGAATGGATA AAAAACACAT TACTCTATTT ACAAAAAAAT 
TATGAAGTGA ACAAAGCCAA TATTGTCGGG CACTCTATGG GTGGTGTTAG TGGTTTACGT 
TATTTAGGAA CCTATGGGCA AGATACATCG TTACCTAAAA TTGAAAAATT CGTCAGCATT 
GGAGCACCTT TCAATGATTT TATTGATACG AGTCAACAGC. AAACCATCGA AACGGAACTA 
GAAAACGGCC CCACAGAAAA AAGTAGC CGC TATTTGGATT ATCAAGAGAT GATTAATGTT 
GTTCCAGAAA AACTGCCCAT TTTATTAATT GGTGGTCAAT TAAGTCCAAC AGATTTAAGT 
GATGGAACGG TGCCGTTATC TAGTGCCTTA GCAGTCAACG CCTTGCTAAG ACAGCGAGGA 
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA GAAAATGCAC AACATAGTCA ATTACATGAA 
AATCCTGAAG TAGATCAATT GCTAATCGAA . TTTCTATGGC CGAGTAAAAA ATAG 

EF050-4 (SEQ ID NO: 188) 

CQ IG ATTKDDNQAA 

TKEATVELNR TTTPTLFFHG YAGTKNSFGS LLHRLEKQGA TTQELVLLVK PDGTWKERG 
ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQKNY QVNKANIVGH SMGGVSGLRY 
LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE NGPTEKSSRY LDYQEMINW 
PEKLPILLIG GQLSPTDLSD GTVPLSSALA VNALLRQRGT QVTSQIIKGE NAQHSQLHEN 
PEVDQLLIEF LWPSKK 



EF051-1 (SEQ ID NO: 189) 

TAAAAGAAAA GAGGCGTTCA AATGTCTAAA CAAAAAAAGG CTGTGTTCCT GCTTAGTTTA 
TTCAGTTTAG TTGCCCTAAT TGCTGCATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 
ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAACTGG AACCTATGAT 
AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA 
GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG 
CATGATTTTT CTGATTTAAA AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT 
CCAGCTGGTC AATATGCCGA AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC TTATGTAGAA 
AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT 
GCAGAAGCTG GCTTAGTTTA TGCGAGAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT 
GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 
GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG ■ 

EF051-2 (SEQ ID NO:190) 

MSKQ KKAVFLLSLF SLVALIAACT NQPQKETVST KKEEITLAAA ASLESVMEKK 
IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSW 
PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP AGQYAEEGLK ALGAWSYVEK 
HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK 

EF051-3 (SEQ ID NO:191) 

ATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 

ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
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AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAACTGG AACCTATGAT 
AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA 
GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG 
CATGATTTTT CTGATTTAAA AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT 
CCAGCTGGTC AATATGC CGA AGAAGGCTTA AAAGC TTTAG GCGCTTGGTC TTATGTAGAA 
AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT 
GCAGAAGCTG GCTTAGTTTA TGCGACAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT 
GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 
GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG 

EF051-4 (SEQ ID NO: 192) 

CT NQPQKETVST KKEEITLAAA ASLESVMEKK 

IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSW 
PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP AGQYAEEGLK ALGAWSYVEK 
HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK 



EF052-1 (SEQ ID NO: 193)- 

TAAAGTAGGA GAAGCGCAAG CGAAAAAAGT GAATCAATCG GCAGCGTATC AAGTAGTGAT 
CCCACAATGG GTACCATGGG TAGCATTATC TTTGACAGTA GCACTTGCTG GATTGATTGC 
TTACTTAGTT CGTCGTGGAG AGAAGTGGAA AAACGAAGGG GAAGTGACAT AATGAGANGA 
NGAAATCTTC NGTTTTTATT ATTGTTGGTT CTATTAATTT ATATTCCTCA AACAACTTAT 
GCAGAAAATA GGGAGACCAC AGAAGTCGGA ATCGGGTTTA CAAAAACTTC AGACATACCA 
TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA CCATTCAATC GCTATCAATC 
GTTCGTAGCA GAACGCAAAT AAAAAGATTA CCTAAAACTG GTGACAATCG AATAACTTGG 
CTAAGCTGGT TTGGCATATT GTTTTTAATA AGTAGTTTTT GGCTGTTTCT ATTTAGACAA 
TTATGTAGAA AAGGAGAATA A 

EF052-2 (SEQ ID NO: 194) 

MRXX 

NLXFLLLLVL LIYIPQTTYA ENRETTEVGI GFTKTSDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKRLP KTGDNRITWL SWFGILFLIS SFWLFLFRQL CRKGE 



EF052-3 (SEQ ID NO: 195) 

AGAAAATA GGGAGACCAC AGAAGTCGGA ATCGGGTTTA CAAAAACTTC AGACATACCA 
TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA CCATTCAATC GCTATCAATC 
GTTCGTAGCA GAACGCAAAT AAAAAGAT 

EF052-4 (SEQ ID NO:196) 

ENRETTEVGI GFTKTSDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKR 



EF053-1 (SEQ ID NO:197) 



TAGTCATGGC ACCATAACAA GGAGGAGAGA AGTGAGATGA AAAAATACCT TTTGCTTAGT 
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TGTTTTTTAG GTCTTTTCAG CTTCTGTCAT TCAGACACTG CGTTTGGAGA AGCAGCTTAT 
GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG GAAGATTACC AGCGACAGGT 
ACCACCAATC AAGCACCATT TATTTATTTG GGAATCAGCC TTATCACTAT AGGCATATTA 
TTTATTAAAA GGAGAAGAGA AGATGAAAAA AACAGTATTA GCAGTAGTAG GGATTGTAGG 
ATTTAG 

EF053-2 (SEQ ID NO:198) 

MKKYLLLSC FLGLFSFCHS DTAFGEAAYE NSGWSFYGT YEYPTEESTT 

ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGRLPATGT TNQAPFIYLG ISLITIGILF 

IKRRREDEKN SISSSRDCRI 

EF053-3 (SEQ ID NO:199) 

TTTGGAGA AGCAGCTTAT 

GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG GAAGA 

EF053-4 (SEQ ID NO:200) 

FGEAAYE NSGWSFYGT YEYPTEESTT 
ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGR 



EF054-1 (SEQ ID NO:201) 

TAAATAAAAA ATTATTTGGA GGAAATTACA ATGAAAAAAA TTATTTTATC AAGCTTGTTT 
AGTGCAGTAC TAGTATTCGG TGGCGGAAGT ATAACAGCAT TCGCTGACGA TTTAGGACCA 
ACAGATCCAG CAACTCCACC AATTACCGAA CCAACTGATT CTAGTGAACC TACGAATCCT 
ACTGAGCCGG TGGATCCTGC AGAACCGCCA GTAATACCAA CTGATCCAAC AGAACCAAGC 
AAGCCAACCG AGCCTACAAC ACCGAGTGAG CCAGAAAAGC CAACAGAACC AACAACGCCA 
ATTGATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA 
CCAACCGAGC CTACAACACC. AAGCGAACCA GAAAAACCTG TTACTCCAGA ACAACCGAAA 
GAACCAACTC AACCAGTGAT TCCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT 
GAACAGCCCA CTAAACCAAT AGACGTAGTC GTTACACCTA GTGGAGAAAT TGATAAAACG 
AATCAATCGG CAGGAACACA ACCAAGTATT CCTATTGAAA CAAGCAACTT AGCGGAGGTA 
ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA 
GTAGATAAAG GTGTTCCGTT AACCAAAACA CCAGAAGGAT TAAAACCAAT TAGCAGCTCG 
TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAAGCAA GTGATGGAAA AATGAAAGTA 
TTGCCACATA CAGGAGAGAA ATTCACACTC CTTTTCTCTG TATTGGGAAG CTTCTTTGTA 
TTAATTTCAG GATTCTTTTT CTTTAAAAAG AATAAGAAAA AAGCTTAA 

EF054-2 (SEQ ID NO:202) 

M KKIILSSLFS AVLVFGGGSI TAFADDLGPT DPATPPITEP TDSSEPTNPT 
EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVW TPSGEIDKTN 
QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KAS DGKMKVL PHTGEKFTLL FSVLGSFFVL ISGFFFFKKN KKKA 



EF054-3 (SEQ ID NO:203) 
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A 

ACAGATCCAG CAACTCCACC AATTACCGAA CCAACTGATT CTAGTGAACC TACGAATCCT 
ACTGAGCCGG TGGATCCTGC AGAACCGCCA GTAATACCAA CTGATCCAAC AGAACCAAGC 
AAGCCAACCG AGCCTACAAC ACCGAGTGAG CCAGAAAAGC CAACAGAACC AACAACGCCA 
ATTGATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA 
CCAACCGAGC CTACAACACC AAGCGAACCA GAAAAACCTG TTACTCCAGA ACAACCGAAA 
GAACCAACTC AACCAGTGAT TCCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT 
GAACAGCCCA CTAAACCAAT AGACGTAGTC GTTACACCTA GTGGAGAAAT TGATAAAACG 
AATCAATCGG CAGGAACACA ACCAAGTATT CCTATTGAAA CAAGCAACTT AGCGGAGGTA 
ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA 
GTAGATAAAG GTGTTCCGTT AACCAAAACA CCAGAAGGAT TAAAACCAAT TAGCAGCTCG 
TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAAGCAA GTGATGGAAA AATGAAAGTA 
T 



EF054-4 (SEQ ID NO:204) 



DDLGPT DPATPPITEP TDSSEPTNPT 

EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVW TPSGEIDKTN 
QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KASDGKMKV 



EF055-1 (SEQ ID NO:205) 

TAACAAAAGG TTGTTTTGTC TTTCTTGTGT 
GGAGGTTTTT CAATGAAAAA AAAGCGTTAT 
AGTTTTTTTA TAAATGTTGA AGCGTCTGAT 
TACCAAAATC CGAGAACACC CGCTCCTAAA 
GCTGATCCCA AGGAACCAGC TGGTCCTCCG 
CAGACCACCA CAACTGGCTC AACGCTCCCT 
AGCATTCTCN GNTTCGCCTT AATCGGTTTG 
CATGAAGCAA ACTAA 

EF055-2 (SEQ ID NO:206) 



AAAAGGGCAA GAAAGGCTAG CGAGTTAAAA 
TTAATGATTG TGTGTCTACT ATCTTCTCCT 
GGTGGTTCTA GTTCGGTGGG GATTGAATTT 
GATCCCCCAC CGAAAACAGA TGCGCCAGCT 
CAAGGAGATC AACGAAGTGG TGGTTCGACA 
CGTACAGGGA GCAAGAGTCA GGCAAATTTG 
GCGGGAATCG TACATAGAAA GAAGGGACGA 



MKKKRYL MIVCLLSSPS FFINVEASDG GSSSVGIEFY 

QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTTGSTLPR TGSKSQANLS 
ILXFALIGLA GIVHRKKGRH EAN 

EF055-3 (SEQ ID NO:207) 



AGCGTCTGAT GGTGGTTCTA GTTCGGTGGG GATTGAATTT 

TACCAAAATC CGAGAACACC CGCTCCTAAA GATCCCCCAC CGAAAACAGA TGCGCCAGCT 
GCTGATCCCA AGGAACCAGC TGGTCCTCCG CAAGGAGATC AACGAAGTGG TGGTTCGACA 
CAGACCACCA CAACTGGCTC AACG 



EF055-4 (SEQ ID NO:208) 



SDG GSSSVGIEFY 

QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTTGST 
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EF056-1 (SEQ ID NO:209) 

TAAATGAAAA AAAAGCGTTA TTTAATAATT GCGTGTTTAC TATTTTCCCC TAGTTTTTTT 
ATAAATGTTG AAGCATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT 
CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC 
ACAGCTGGCT CGCAGCTCCC TCGTACAGGA AGCAAGAGTC AGGCAAACCT GAGCATTCTT 
GGTCTTGTCT TGATTGGTCT TGTCGGAATG GTCCAGAGAA AGAAGGGACG ACATGAAGCA 
AACTAA 



EF056-2 {SEQ ID NO:210) 



MKKKRYLIIA CLLFSPSFFI NVEASEGGSS SVG I EFYQNP ATPAPKDAPP KTDEPAADPK 
EPAGPLQGDQ RSGGSTQTTT AGSQLPRTGS KSQANLS ILG LVLIGLVGMV QRKKGRHEAN 

EF056-3 (SEQ ID NO:211) 

ATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT 

CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC 
ACAGCTGGCT CGCAG 

EF056-4 (SEQ ID NO: 212) 



SEGGSS SVG I EFYQNP ATPAPKDAPP KTDEPAADPK 
EPAGPLQGDQ RSGGSTQTTT AGSQ 



EF057-1 (SEQ ID NO:213) 



TAATGTTTAT TGGCTGGGCC AGTCAATGTT GAAAATGGGG AAGGAGGAAT TCAGATGAAA 
ATCATAAAAA GGTTTAGTTT GGTATGTTTA GGGCTATTGA TCATTGGGTT GCNAACAAAA 
AGCGNTATGG CTGAAGAAAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 
TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA ATCAGAAGAA 
CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT TACCTAAAAC AGGCGAGTCT 
GAAAATCCGC TGTATTCCTT GATAGGAGTT AGTTTGTTGG GGATAGTCAT TTATTTAATT 
AATAAAATGA AACGAGAGAA GGAGTTTATT TAA 

EF057-2 (SEQ ID NO:214) 

MKI IKRFSLVCLG LLIIGLXTKS XMAEENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AVLPKTGESE NPLYSLIGVS LLGIVIYLIN 
KMKREKEFI 



EF057-3 (SEQ ID NO:215) 

AAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 

TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA ATCAGAAGAA 
CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT 

EF057-4 (SEQ ID NO:216) 

EENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AV 
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EF058-1 (SEQ ID NO:217) 

TGAAGAACGT TCTATTTGGT TGACGATTGC AGGCCTGCTA ATCATTGGGA TGGTAGTCAT 
TTGGCTATTT TATCAAAAAC AAAAAAGAGG AGAGAGAAAA TGAAGCAATT AAAAAAAGTT 
TGGTACACCG TTAGTACCTT GTTACTAATT TTGCCACTTT TCACAAGTGT ATTAGGGACA 
ACAACTGCAT TTGCAGAAGA AAATGGGGAG AGCGCACAGC TCGTGATTCA CAAAAAGAAA 
ATGACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AAATATCAAG GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT GCAGCTAAAC AAGCTGTCCA AAGTTTAACT 
CCTGGGAAAC CTGTTGCTCA AGGAACCACC GATGCAAATG GGAATGTCAC TGTTCAGTTA 
CCTAAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT 
GTAGTTGCTG CTACGAATAT GGTGGTGGCG TTCCCAGTTT ACGAAATGAT CAAGCAAACA 
GATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG TTCATATTTA TCCTAAAAAT 
GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
TTAAATGGCG CAGAATTTGT TATTTCTAAA AGCGAAGGCT CACCAGGCAC AGTAAAATAT 
ATCCAAGGAG TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC 
TTTATTACTG GGAAAAGTTA TGAAATTGGC GAAAATGATT TCACAGAAGC AGAGAATGGA 
ACGGGAGAAT TAACAGTTAA AAATCTTGAG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA 
AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA 
ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG CTAATAAATA CGTCAAATTC 
AATTTAGTTG ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGGAGAGTAT 
GCTTATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACGAGGC 
GGCACACTAA AATTCGTTTA CTTTATGCAT TTAAATGAAA AAGCAGATCC TACGAAAGGC 
TTTAAAAATG AGGCGAATGT TGATAACGGT CATACCGACG ACCAAACACC ACCAACTGTT 
GAAGTTGTGA CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA GCGACAGAGC AAATTATTTG 
AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
TTTACAACAA CGGCTGATGG ATTAGTTGAT ATCACAGGGC TTAAATACGG TACCTATTAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA CAAATCGGAT TGAATTTGTG 
GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC 
AAACACAAAG GTACCTTACC TTCAACAGGT GGCAAAGGAA TCTACGTTTA CTTAGGAAGT 
GGCGCAGTCT TGCTACTTAT TGCAGGAGTC TACTTTGCTA GACGTAGAAA AGAAAATGCT 
TAA 

EF058-2 (SEQ ID NO:218) 
MKQLKKVW YTVSTLLLIL PLFTSVLGTT 

TAFAEENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK YQGLADVTFS IYNVTNEFYE 
QRAAGASVDA AKQAVQSLTP GKPVAQGTTD ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
VAATNMWAF PVYEMIKQTD GS YKYGTEEL AWH I YPKNV VANDGSLHVK KVGTAENEGL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTW TTDKEQAKRF ITGKSYEIGE NDFTEAENGT 
GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG TLKFVYFMHL NEKADPTKGF 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK VDGDVTATQA LAGASFWRD QNSDTANYLK 
IDETTKAATW VKTKAEATTF TTTADGLVDI TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
NEQSYGTTEN LVSPEKVPNK HKGTLPSTGG KGIYVYLGSG AVLLLIAGVY FARRRKENA 



EF058-3 (SEQ ID NO:219) 
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AGAAGA AAATGGGGAG AGCGCACAGC TCGTGATTCA , CAAAAAGAAA 

ATGACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AAATATCAAG GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT GCAGCTAAAC AAGCTGTCCA AAGTTTAACT 
CCTGGGAAAC CTGTTGCTCA AGGAACCACC GATGCAAATG GGAATGTCAC TGTTCAGTTA 
CCTAAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT 
GTAGTTGCTG CTACGAATAT GGTGG TGGCG TTCCCAGTTT ACGAAATGAT CAAGCAAACA 
GATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG TTCATATTTA TCCTAAAAAT 
GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
TTAAATGGCG CAGAATTTGT TATTTCTAAA AGCGAAGGCT CACCAGGCAC . AGTAAAATAT 
ATCCAAGGAG TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC 
TTTATTACTG GGAAAAGTTA TGAAATTGGC GAAAATGATT TCACAGAAGC AGAGAATGGA 
ACGGGAGAAT TAACAGTTAA AAATCTTGAG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA 
AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA 
ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG CTAATAAATA CGTCAAATTC 
AATTTAGTTG ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGGAGAGTAT 
GCTTATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACCAGGC 
GGCACACTAA AATTCGTTTA CTTTATGCAT TTAAATGAAA AAGCAGATCC TACGAAAGGC 
TTTAAAAATG AGGCGAATGT TGATAACGGT CATACCGACG ACCAAACACC ACCAACTGTT 
GAAGTTGTGA CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA GCGACACAGC AAATTATTTG 
AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
TTTACAACAA CGGCTGATGG ATTAGTTGAT ATCACAGGGC TTAAATACGG TACCTATTAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA CAAATCGGAT TGAATTTGTG 
GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC 
AAACACAAAG GTACCTTACC T 

EF058-4 (SEQ ID NO:220) 

EENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK YQGLADVTFS IYNVTNEFYE 
QRAAGASVDA AKQAVQSLTP GKPVAQGTTD ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
VAATNMWAF PVYEMIKQTD GSYKYGTEEL AWHIYPKNV VANDGSLHVK KVGTAENEGL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTW TTDKEQAKRF ITGKSYEIGE NDFTEAENGT 
GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG TLKFVYFMHL NEKADPTKGF 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK VDGDVTATQA LAGASFWRD QNSDTANYLK 
IDETTKAATW VKTKAEATTF TTTADGLVDI TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
NEQSYGTTEN LVSPEKVPNK HKGT 



EF059-1 (SEQ ID NO:221) 

TAGATTGGAA GAATGAAAAT GAAAAAAATG 
TTAGCAGGGG GAAGCAGTGT TTCTGCTTAT 
ACAACAGGGA GTGTTTTACC AGATGAACCG 
GAGCCAGAGC AACCAACAGA GCCAAGTACA 
ACCGAACCTA GTGAGCCTTC AAAACCGACG 
CCAACAGAGC CAACAACGCC AAGTAAGCCA 
GTACCAGAGC AACCAACAGA GCCAAGTGTA 



ATTATTATTG CCTTATTCAG TACAAGCCTT 
GCGCAAGAAT CAGAAGGAAA TCTTGGTGAA 
AATGTACCAA CTGACCCAAT AACGCCAAGT 
CCAGAGCAAC CATCGGAACC GTCAACACCA 
GATCCTTCGT TACCAGACGA ACCGAGCGTA 
GAGCAACCAA CAGAGCCAAC AACGCCAAGT 
CCAGAAAAAC CAGTAGAACC AAATAAACCA 



WO 98/50554 



PCT/US98/08959 



135 

TABLE 1 . Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 

ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA 
CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT 
GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTACTTCC TTACACTGGT 
GAAAAAATGG GCATAATTGG GTCAATCGCT GGTGTATGTT TGACTGTTTT ATCAGGAATC 
TTAATTTATA AAAAACGTAA AGTGTAG 

EF059-2 (SEQ ID NO:222) 

MKKMI IIALFSTSLL AGGSSVSAYA QESEGNLGET TGSVLPDEPN VPTDPITPSE 
PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP TEPTTPSKPE QPTEPTTPSV 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP EKPWPQQPE QPTDVWKPN GEIATGESTQ 
QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP LTQTADGLKP IKSEYKVLPS 
GNVQVKSADG KMKVLPYTGE KMGIIGSIAG VCLTVLSGIL IYKKRKV 

EF059-3 (SEQ ID NO:223) 

AGAAGGAAA TCTTGGTGAA 

ACAACAGGGA GTGTTTTACC AGATGAACCG AATGTACCAA CTGACCCAAT AACGCCAAGT 
GAG CC AGAGC AACCAACAGA GCCAAGTACA CCAGAGCAAC CATCGGAACC GTCAACACCA 
ACCGAACCTA GTGAGCCTTC AAAACCGACG GATCCTTCGT TACCAGACGA ACCGAGCGTA 
CCAAC AGAGC CAACAACGCC AAGTAAGCCA GAGCAACCAA C AGAGC CAAC AACGCCAAGT 
GTACCAGAGC AACCAACAGA GCCAAGTGTA CCAGAAAAAC CAGTAGAACC AAATAAAC C A 
ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA 
CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT 
GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTAC 

EF.059-4 (SEQ ID NO:224) 

EGNLGET TGSVLPDEPN VPTDPITPSE 
PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP EKPWPQQPE 
QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP 
GNVQVKSADG KMKV 

EF060-1 (SEQ ID NO:225) 

TGAAAAATAG ACAAGGAGCA CGCGATGATG ACAATGAAAA GTAAAGGGTC ACTTCTGGTG 
ACGTTGGGAA TACTTTTAAC CGTTGGCATT GCGAGTCTAA TTGTTTCTTC TGAGAGTTTT 
GCAGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGC C ACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT TACCAAGGAC AGGTGAACGA AATAGCACGT GGCTTTACAG CCTTGGTATT 
GCCTGTTTAC TCGTAGTACT AAC AAGTTTC TATTATTTGA ATAAAAAAAG GAAAAAGGAA 
AAATAA 

EF060-2 (SEQ ID NO:226) 

MMT MKSKGSLLVT LGILLTVGIA SLIVSSESFA EEVGQTNIGV TFYGGKEPLK 



TEPTTPSKPE QPTEPTTPSV 
QPTDVWKPN GEIATGESTQ 
LTQTADGLKP IKSEYKVLPS 
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TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TSLPRTGERN STWLYSLGIA 
CLLWLTSFY YLNKKRKKEK 

EF060-3 (SEQ ID NO:227) 

AGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT 

EF060-4 (SEQ ID NO: 228) 
EEVGQTNIGV TFYGGKEPLK 

TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TS 
EF061-1 (SEQ ID NO:229) 



TAATGGAACG AC CG AC AG AA GAAGATTTTG 
ATAATGATGA AAAAAATTCT TTTTGCTAGT 
AGTGAAATTT CTGCTTTTGC ACAAGAAATT 
GAAGTACCAA CAGAACCAAG TACACCAGAA 
CCACCTGTAG ACCCTGTAGA GCCACCTATT 
CCGACAACAC CAACAGAACC TACAACTCCT 
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA 
GAAAAAACTG TGACACCAAC TAAACCAACA 
CCAAGCAAGC CAATCGACGT TGTTGTAACG 
GGTACACAAC AGCCAACAGT CCCTATTGAA 
CCTAGTGTAA CAACACCTAT TACAACTACA 
GGTGTTCCAC TTACACAAAC AGCAGAAGGG 
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT 
ACAGGTGAAG AAATGAATAT CTTTTTATCT 

EF061-2 (SEQ ID NO:230) 



AACTTACAAA TTAAAATTAA AATGGAGGAA 
TTATTTAGTG CCACACTACT ATTTGGGGGA 
ATCCCTGATG ATACTACGAC ACCGCCCATT 
AAGCCAACAG ATCCAACACC GCCAATTGAG 
ACACCAACGG AGCCAACAGA ACCGACAGAG 
ACAGAGCCAA GTGAACCAGA ACAACCAACG 
CCAGTTACAC CAAGCAAACC AGCAGAACCC 
GAATCTGAAA AACCAGTACA ACCAGCAGAA 
CCAACAGGGG AATTAAATCA CGCTGGAAAT 
ACAAGTAATT TGGCAGAAAT CACGCACGTG 
GACGGAGAAA ACATTGTAGC TGTAGAAAAA 
TTAAAACCTA TTCAATCNAG TTACAAAGTA 
AAGGACGGTA AAATGAAGGT TTTACCATAC 
GCCGTAGCGG TATCTTGTCT GTAG 



MMKKILFASL FSATLLFGGS 
VPTEPSTPEK PTDPTPPIEP 
PSKPVEPEKP VTPSKPAEPE 
TQQPTVPIET SNLAEITHVP 
PSGNVEVKGK DGKMKVLPYT 



EISAFAQEII PDDTTTPPIE 
PVDPVEPPIT PTEPTEPTEP 
KTVTPTKPTE SEKPVQPAEP 
SVTTPITTTD GENIVAVEKG 
GEEMNIFLSA VAVSCL 



TTPTEPTTPT EPSEPEQPTE 
SKPIDWVTP TGELNHAGNG 
VPLTQTAEGL KPIQSSYKVL 



EF061-3. (SEQ ID NO:231) 

GAAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTACGAC ACCGCCCATT 
GAAGTACCAA CAGAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG 
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACCGACAGAG 
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG 
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC 
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTACA ACCAGCAGAA 
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT 
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG 
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA 
GGTGTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA 
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT AAGGACGGTA AAATGAAGGT TT 
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EF061-4 (SEQ ID NO:232) 



QEII PDDTTTPPIE . 

VPTEPSTPEK PTDPTPPIEP PVDPVEPPIT 
PSKPVEPEKP VTPSKPAEPE. KTVTPTKPTE 
TQQPTVPIET SNLAEITHVP SVTTPITTTD 
PSGNVEVKGK DGKMKV 



PTEPTEPTEP TTPTEPTTPT EPSEPEQPTE 
SEKPVQPAEP SKPIDWVTP TGELNHAGNG 
GENIVAVEKG VPLTQTAEGL KPIQSSYKVL 



EF062-1 (SEQ ID NO:233) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA 
GATAATGTAC AAGCCGCGGA ATTAGATACG 
AACCCCGACC TGCAGTCAGA AAAGGAAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT 
GGAGCTGAAA AATCAGCACA AGAACAACCA 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT 
AACATTACCG TTGTTGAAAA ACCAGCAGAA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN 
AAGAACGAAA ACAGCTATGT CAATGAAGCG 
GTCGTGACGA AAGACACTAA AATTTCGTCG 
GATTTTAATA AAGTAAATGC AGGGGATTCA 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA 
GGACTAAACG CTAGTTATTT AGGACGTAAA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG 
GATTTTGGGG CNAACAATGC GTTCAAATAC 
GATGGAAAAT TTTACTCACC GGAAGATATT 
AATAGTGATT GGGACGCTGT AGGTCACAAG 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN 
ATTTTCAATT ATGGGAATCC AAAAGAACCA 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN 
NTCAATGATT TAAATGTGAA NCGTGGCGAT 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA 
GATGCAGAAA AAGTGACGAT TGATTTATCC 
CTNAACGANA AAG AC TNAAA AGCTGTTGCT 
GTGACTGCTT CTTATGANCT CAATTTAGAT 
AACGCNGACG GNTCNGTTGT TTTAGCAATG 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA 
GAAACGGTAA CAAATACAGT GATTAACCAT 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT 
CAAACAAAAA TTTATTATGA AGTGAAATCT 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG 
TGGCACGCTA TTACNAANTA TGACCTTAAA 
GATATTTCTG CCTACATTCT TTTAGAAAAC 
AATCAAGCAT TATTGGCNGC NTTAAATGAA 



AAACAATATA AGACATATAA AGCTAAGAAT 
AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
CAACCAGAAA CAACGACGGT TCAACCCAAT 
CCTAAAACGG CAGTATCTGA AGAAGCAACA 
AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GATACCACAA ACGCGCAACA ACCAACAGTA 
GTAGTAAGCC CTGAAACAAC CAATGAACCT 
GAAAATGAAG TGAATAAATC AACGTCCATT 
AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GANAAAGAAG TCGCNGAATA CAACAAGCAT 
ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
AAAGATATCT TTACAAAATT ACGGAAAGAT 
AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
AAAAATAAAC CAGTGACAGT GACCTATACA 
ATTACAAAAG CAGAATTTGT TTATGAACTA 
AATGCAGTAT TTTCAAACGA TCCGATTATC 
GGTAAGGATG TTAAAACACG CTTAACGATT 
CTACCAGATA AAGATAGTCC ATTTGCGTAT 
AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
ATGACAACAA AAGGAAAAAG TAATGTGCCT 
ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
GAAAAAGCAA CGATTGAATT CAATNGATAC 
AATAAAGAAG TCACTGATGG NCAGAAAAAT 
TCTTTACAAT ACATTGTGAC AGGGGATACG 
GTAACNAAAC AAGGGATTCG AGATACNTTT 
AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
CAAAACACCG TCACAGCAAT GATGAAAACC 
GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
AATACAGCTG TTCAGCTGAC AAANGATGGN 
GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
GTTTCTCTAC ATGATAAAGA TATTC CGTT A 
TCCGAACGTC CAGCNAACTA TGGCGGAATN 
GACACGACCC ATGATCGTTT CACAGGNAAA 
GTAGGGGANA AAACGTTAAA AGCAGGAACA 
AAAGACAATA AAGAC TTGAC GTTTACNATG 
GG AAG CAATA AAGTAGGCAA ACAAGCTTGG 
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TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA C C AAAAAC AC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF062-2 (SEQ ID NO:234) 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN . ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVER I AAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 



EF062-3 (SEQ ID NO:235) 



TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG 



AGCTAAGAAT 
ATTAGCTACT 
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GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 
AACCCCGACC _ TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAG TAG CGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT AC GG AAAG AT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GG AAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG . TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT . CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAG AC TTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
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TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 

TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TG AATAAAGG GGACGACATT 

TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 

TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 

ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 

AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 

CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 

TTAGTGGTAG AAAAGGCAAG TG 



EF062-4 (SEQ ID NO:236) 

. AELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV. VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGP3GLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASV 

EF063-1 (SEQ ID NO:237) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
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ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG • CC AAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGG AC TTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA GAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG. AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF063-2 (SEQ ID NO:238) 
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MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AF I GTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKP I FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 



EF0 63-3 (SEQ ID NO: 23 9) 

GGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 

AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGCC CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATCTTGGG . 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAAC CAGTGACAGT GACCTATACA 
GG AC TAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACf ATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
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AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
AC AG AAC TTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TG ATTTATC C AAAGTG 



EF063-4 (SEQ ID NO:240) 
ELDTQ PETTTVQPNN 

PDLQSEKETP KTAVSEEATV QKDTTSQPTK 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV 



VEEVAPENKG TEQSSATPND TTNAQQPTVG 
NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
TKQGIRDTFD AEKVTIDLSK V 



EF064-1 (SEQ ID NO:241) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA 
GATAATGTAC AAGCCGCGGA ATTAGATACG 
AACCCCGACC TGCAGTCAGA AAAGGAAACA 
GTACAAAAAG ACACTACTTG TCAACCGACC 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT 
GGAGCTGAAA AATCAGCACA AGAACAACCA 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT 
AAC ATT AC CG TTGTTGAAAA ACCAGCAGAA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN 
AAGAACGAAA ACAGCTATGT CAATGAAGCG 
GTCGTGACGA AAGACACTAA AATTTCGTCG 
GATTTTAATA AAGTAAATGC AGGGGATTCA 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA 
GG AC TAAACG CTAGTTATTT AGGACGTAAA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG 
GATTTTGGGG CNAACAATGC GTTCAAATAC 
GATGGAAAAT TTTACTCACC GGAAGATATT 
AATAGTGATT GGGACGCTGT AGGTCACAAG 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT 
GTATC TAGTG CGCAATGGTT TGCCTTTAGN 
ATTTTCAATT ATGGGAATCC AAAAGAACCA 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN 
NTCAATGATT TAAATGTGAA NCGTGGCGAT 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA 



AAACAATATA AGACATATAA AGCTAAGAAT 
AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
CAACCAGAAA CAACGACGGT TCAACCCAAT 
CCTAAAACGG CAGTATCTGA AGAAGCAACA 
AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GATACCACAA ACGCGCAACA ACCAACAGTA 
GTAGTAAGCC CTGAAACAAC CAATGAACCT 
GAAAATGAAG TGAATAAATC AACGTCCATT 
AAAGCAGTTG . ATGAAGTAAA AAAAGATCCA 
GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GANAAAGAAG TCGCNGAATA CAACAAGCAT 
ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
AAAGATATCT TTACAAAATT ACGGAAAGAT 
AATTCCTTTG TAAAAGAGGC AAATCTTGGG 
AAAAATAAAC CAGTGACAGT GACCTATACA 
ATTACAAAAG CAGAATTTGT TTATGAACTA 
AATGCAGTAT TTTCAAACGA TCCGATTATC 
GGTAAGGATG TTAAAACACG CTTAACGATT 
CTACCAGATA AAGATAGTCC ATTTGCGTAT 
AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
ATGACAACAA AAGGAAAAAG TAATGTGCCT 
AC TAAC TTAA ATGCGCAATC AGTGAAGCCT 
GAAAAAGCAA CGATTGAATT CAATNGATAC 
AATAAAGAAG TCACTGATGG NCAGAAAAAT 
TCTTTACAAT ACATTGTGAC AGGGGATACG 
GTAACNAAAC AAGGGATTCG AGATACNTTT 
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GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCJTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTOTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGC C A 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF064-2 (SEQ ID NO:242) 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TEQSSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQQ AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFDQSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANLGS NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI . NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRDTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
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FDTVDLATGV SFFDDYDETX VTPIKDLLRV 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG 
GVERIAAGDV YNTIEESFNN EKIKTNTWT 
WEKASWPE LPQTGEKQNV LLTVAGSLAA 

EF064-3 (SEQ ID NO:243) 



KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
WKITASQAF XDAMNLKENK NVAHSWKAF I 
HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
MLGLAGLGFK RRKETK 



AGTGACGAT TGATTTATCC AAAGTGAAAG f 
CTNAACGANA AAGACTNAAA AGCTGTTGCT 
GTGACTGCTT CTTATGANCT CAATTTAGAT 
AACGCNGACG GNTCNGTTGT TTTAGCAATG 
GTAGTGAAAA ATG TAG AAGG CGATTTTGAA 
GAAACGGTAA CAAATACAGT GATTAACCAT 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT 
CAAACAAAAA TTTATTATGA AGTGAAATCT 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG 
TGGCACGCTA TTACNAANTA TGACCTTAAA 
GATATTTCTG CCTACATTCT TTTAGAAAAC 
AATCAAGCAT TATTGGCNGC NTTAAATGAA 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT 
AAACCAACCA AAGCCGTTCA TAACAAGAAA 
CGTGGTGATG TTCTTTCTTA TGAAATGACN 
GCCTTTGATA CAGTCGATCT TGCGACAGGC 
AANGTGACAC CAATCAAAGA CTTACTTCGT 
AACCAGTTCA CGATCTCNTG GGACGATGCC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT 
CGAATTAAAA CCAATACNGT TGTCAACCAT 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA 
TTCTTCTATG AATTTACAAG TAGTGACATT 
TGGTCGATTA GCGATAAACT AGACGTCAAA 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC 
TCGAAACTAT TCACGATGAC CTTTGAACAA 
TTTTTNGATG CGATGAATCT AAAAGAAAAC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG 
CCAGAAAAAA CAGTGATTGT ACCACCAACA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC 
. AAACGTAGAA AAGAAACAAA ATAA 



[TTATCAAGC AGACGCAAGT 
GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 
CAAAACACCG TCACAGCAAT GATGAAAACC 
GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
AATACAGCTG TTCAGCTGAC AAANGATGGN 
GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
TCCGAACGTC CAGCNAACTA TGGCGGAATN 
GACACGACCC ATGATCGTTT CACAGGNAAA 
GTAGGGGANA AAACGTTAAA AGCAGGAACA 
AAAGACAATA AAGACTTGAC GTTTACNATG 
GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 
ACAGGTGACG TAGAAAACAC GCAAACAGAA 
ACNGTGGTGA CGCATACNCC TGATGATCCA 
GGGGAAGANA TTAANCATGG AAAAGTNGCT 
TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GTTTCTTTCT TCGATGATTA CGATGAAACG 
GTCAAAGATT CTAAAGGGGN AGACATTACG 
AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CAAGAATTGC GTGTAACNCT CCCTACAAAA 
AATTCAGCGG AACAAAATAC ATTTGGNCAA 
ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
AATGGNGCCA CAATCAAATT AGGGGAGAAN 
CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
GGAACCAAAG TGAATAAAGG GGACGACATT 
GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
AAAAACGTTG CACACTCATG GAAAGCGTTC 
GTTTACAACA .CAATCGAAGA ATCTTTCAAC 
ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GCAATGCTTG GCTTAGCAGG CTTAGGCTTT 



EF064-4 (SEQ ID NO:244) 



VTIDLSK VKVYQADASL 

NXKDXKAVAA AINSGXAKDV TASYXLNLDQ 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD 
ISAYILLENK DNKDLTFTMN QALLAALNEG 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN 



NTVTAMMKTN ADGSWLAMG YKYLLVLPFV 
PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TTHDRFTGKW HAITXYDLKV GXKTLKAGTD 
SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
SAEQNTFGQR IKTNTWNHI PKVXPKKDW 
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IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SISDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTPP EKTVIVPPTP KTPQAPVEPL 
WEKASV 



EF065-1 (SEQ ID NO:245) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT . CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GG6AAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG AC AGC TATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCGAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAAGGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
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GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF065-2 (SEQ ID NO:246) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYL SG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGG I PNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF065-3 (SEQ ID NO:247) 

GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGG AAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
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CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGC TAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATG AC TATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG • AAAATCAGCA AGTAAAGGAA TT 

EF065-4 (SEQ ID NO:248) 

AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK' DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 

EF066-1 (SEQ ID NO:249) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTG AAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
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GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACG ATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CC AC TAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF066-2 (SEQ ID NO:250) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGG I PNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
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PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 

EF066-3 (SEQ ID NO:251) 

GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGAC CGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCA 

EF066-4 (SEQ ID. NO: 252 ) 

AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVT 

EF067-1 (SEQ ID NO:253) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 

ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 

GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 

AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 

GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 

TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 

GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 

ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 

AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 

GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 

ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
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AAGACCAATA CCAATGATTC AATCAATGAA TATC C AGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACGAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTAGTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAAC TTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGC TAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
= GAAACAAGC A CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCGAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GC CTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF067-2 (SEQ ID NO:254) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QS ATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV. DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
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ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 

EF067-3 (SEQ ID NO:255) 

GCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 

GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACGACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT 

EF067-4 (SEQ ID NO:256) 

VLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 



EF068-1 (SEQ ID NO: 257) 

TAGGGGAAGC TAATGATCTT GGTATTTATC GTTTATTTTA AAGAAAAGAG GGACGATCAG 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
CGCAAGATGC TTAATTTAGC AATATCAAGT GGTTTATTAT TTACGTCATT AGCAATCCCT 
GTAAGTATAG CTGTTACCTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 
CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
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GTTACTTTTT TGACTGCCGT TTTGAATGCA GC CAATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGG TTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA . TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGC CAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGG AAAGAA . 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
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ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATC AC TGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GG AATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF068-2 (SEQ ID NO:258) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVEt VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 
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EF068-3 (SEQ ID NO : 259 ) 

CTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 

CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA. GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC 



EF068-4 (SEQ ID NO:260) 

TSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVWVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTP 

EF069-1 (SEQ ID NO:261) 

TAGGGGAAGC TAATGATCTT GGTATTTATC GTTTATTTTA AAGAAAAGAG GGACGATCAG 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
CGCAAGATGC TTAATTTAGC AATATCAAGT GGTTTATTAT TTACGTCATT AGCAATCCCT 
GTAAGTATAG CTGTTACCTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 
CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 
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ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
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GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF069-2 (SEQ ID NO:262) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAA I NAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTI I AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 



EF069-3 (SEQ ID NO:263) 
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AGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 

GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 

AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 

GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 

GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGACTTT AGATTCAGGA 

ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 

CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 

GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 

GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 

ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 

GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 

ACAGCACCAA CTGTAACAGG AGTAAGAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 

ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 

GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 

AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 

TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 

ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 

GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 

ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 

ACTGGAGTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 

ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 

AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 

GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 

ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 

GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAA 



EF069-4 (SEQ ID NO:264) 

AGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTE SQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGT 



EF070-1 (SEQ ID NO:265) 

TAGGGGAAGC TAATGATCTT GGTATTTATC 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT 
CGCAAGATGC TTAATTTAGC AATATCAAGT 
GTAAGTATAG CTGTTACCTC TGGCACAATC 
CTATTATCAA ATGTTACGTC AAATAATGAC 
GCCGCAAACC AAAATCAACC AGTTAATTTC 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC 
GTTACTTTTT TGACTGCCGT TTTGAATGCA 
ATTAC CAGTG GGGCGTTAGG GAATTTAACT 



GTTTATTTTA AAGAAAAGAG GGACGATCAG 
CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
GGTTTATTAT TTACGTCATT AGCAATCCCT 
AGTGCATCAG CAGCGGTCTT GGATATCGAA 
AGTGGCACTT CAACGAGTAA TCGTTGGACA 
ACGGTTTCTG GTGGCGC TTT AGCAGATGCT 
GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATACCAATG TCACGATTGA TCTTTCAAAA 
GCCAATGATT TAACCAATGT GATTACTCAA 
GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
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CGTCAATTGG AATTAGTTAA TAACATTGAA 
GAAACGTTAG CAGCTGACGG CTCATACATT 
GTTTTAGCCC AAAATGTTTC AAACATCTTA 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC 
TCCAACATTT ATTTTGCTGC AGGCACTACT 
GTAACAGGTA ATTCAACAGC AGGTTACGAA 
GTTGAAATCC GAAATGCAGG AGGCACCGTA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG 
GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG 
ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA 
CCAACAACGT TCCAAACACC AGCGGATGAA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG 
GCATTTACAG TTACCATTCC CGCAGGTGAA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG 
AATACGCCCG TGGCGACGCC AATTGTTGAG 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT 
GGCAACTCTG GTTCGGGTTA TGAAATTACA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT 
ACTGTAACGC TACCAACGGG AACGACCAAT 
GATAACGCGG GAAATGAAAG TCAACCGACT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT 
ACCGCTGATC CGAATGCTAC CATCGAAATT 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT 
AATGAAACAT TGACAGCGTT AGCCAAAGAT 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA 
ACAACACAAG GATATCAAGT GACAGGTACC 
GCAACAGACG GAACAGTTTT AGGC AC CGC A 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA 
ACTGGACTTG AGAGTCAACC AACTACAGCT 
ATTGGTGACA TTACTGGAGA TTCAACAACT 
AATACCACCA TTGAAGTACG GAACCCAGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT 
TTAGTTATTG CTACGGGAAC GACTGATGGT 
GGCACAGCAA CAGCTAACGA AGCCTTAACT 
AGTAATCCGA CTGCTTTCAA AACACCTGCT 
GTTGACAAAA TCACTGGTAG CACGACAAAC 
GGTACAACAG TTGAGGTGCG TGACGCCGAT 
ACTGATGGCA AATACACAGT GACTTTAGAG 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA 
GTCGACTTAG CCACACCAAC CATTGATTCT 
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AACTTAGGTG CTGCTTCATT TACAGCTCCG 
AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GTTAATTTAT TAGCAACAGC AGACGGTGTA 
AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG 
ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
CCAACAACGT TCCAAACNCC AGCGGATGAA 
GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTGAAATCC GAAATGCAGG AGGCACCGTA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG 
ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
CCAACAACGT TCCAAACGCC AGCGGATCCT 
ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
ACCATTGAGG TTCGCGATGC ' AGCTGGCACG 
GGAAAATATA CAGTGACTTT AGATTCAGGA 
GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
GTCACTGCAC CAACAGTTGA TAACATCACA 
GGAACAGCAG ACCCTAACAC AACAATCGAA 
ACAGGTACCT CTGATGCGAA TGGTGATTTT 
CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
AATTCAGTTG CTGGTTATCA GGTGACAGGC 
CGTGATGCAG ATGGGAACGT GATTGCAACA 
GTGAACCTTC CAGCTGGGAC GGCAAATGCG 
CCTGCTGGCA ATACAAGTAC ACCGACAACC 
CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
GCTGAACTTG GCACCACCAT TGAAGTTCGT 
ACAACTGGAC CGACTGGCCA ATATAC TGTG 
CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ATGACACCCG CTGATGTTAC CACACCAACA 
GGTTATGAAA TCACTGGGAC GGCGGACCCT 
GGAACAATTA TTGGTACAAC GACAACGGAT 
GCGGGAGCCG CTAATCCTGG TGATACATTA 
GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
ACGACTGTTA CAGGAACAAC TGCCACTGGG 
GTCACCATTG AGATTCACAA TGAAGCAGGT 
GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
GATCCAGATG CACCAGTCGC GACACCTACT 
GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGCACAGTCC TTGGCATGGC AACTACTGGA 
CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
GAAAGTCAGC CAGCTACAGC AACTACACCA 
ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
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ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC AC CG ATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAAGGTAA AAACTAG 

EF070-2 (SEQ ID NO:266) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAA INAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NLPAGTANAN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP P S VDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 

EF070-3 (SEQ ID NO:267) 

CGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
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GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTACC AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AG C AAAGATG GCGCAGGTAA TGAAAGTCAA 
GCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT -TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAGG AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTCGG TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACT 



EF70-4 (SEQ ID NO:268) 
DGDGNE SQPTEVTVPE 

DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YL 



EF071-1 (SEQ ID NO:269) 

TAAGTAGAAG TGGTCGGGAC AAACGTAGAA 
GTCCCGCCAT TTATCTGCAG GTTTAAGCCG 
ATGGCTTTTT TAAGAAAGGA GCATGCTATG 
GTGATTGGTT TAAGTTTAAC GATTCCGATG 
CCAATCAACT TTACTTATTT TCCCGGCTCT 
TCTGGAAACG AGCGGAACCT AGGACCACAC 
CGAAATTGGT CAAATGCTTA TGTCTCATAT 



CTTTCGCTGA TTGCCGAAGA AATTACTTCT 
TGGAAGGGAA GTTATTTTGA CTTTCCTTTC 
TTTAAAAAAT TAATGATTCA ACTTGCTTTA 
ACGGCTTNCG CTTACACCAT CGAAGCGGAT 
GCAAGCAATG AATTAATTGT TTTACATGAA 
AGTTTAGACA ATGAAGTGGC CTATATGAAA 
TTTGTCGGAT CTGGTGGACG AGTGAAACAA 
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TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAATCA AAAAGCCTAT 
GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAAGA TTATGCTGCC 
TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTTTTC TCTGGACGAT 
GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT 
ACAGATCCTT ATGGTTATTT AGCGCGTGGG GGATTAGTAA AGCGCATTGG CACNAGATTT 
ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT CAGCCAGGTA A 



EF071-2 (SEQ ID NO:270) 
MF KKLMIQLALV 

IGLSLTIPMT AXAYTIEADP INFTYFPGSA SNELIVLHES GNERNLGPHS LDNEVAYMKR 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA GSLANQKAYA QIELARTNNA ATFKKDYAAY 
VNLARDLAQN IGADFSLDDG TGYGIVTHDW ITKNWWGDHT DPYGYLARGG LVKRIGTRFT 
TGVSXTGETG HYSAR 

EF071-3 (SEQ ID NO:271) 

G TTTAAAAAAT TAATGATTCA ACTTGCTTTA 

GTGATTGGTT TAAGTTTAAC GATTCCGATG ACGGCTTNCG CTTACACCAT CGAAGCGGAT 
CCAATCAACT TTACTTATTT TCCCGGCTCT GCAAGCAATG AATTAATTGT TTTACATGAA 
TCTGGAAACG AGCGGAACCT AGGACCACAC AGTTTAGACA ATGAAGTGGC CTATATGAAA 
CGAAATTGGT CAAATGCTTA TGTCTCATAT TTTGTCGGAT CTGGTGGACG AGTGAAACAA 
TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAATCA AAAAGCCTAT 
GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAAGA TTATGCTGCC 
TATGTTAATT TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTTTTC TCTGGACGAT 
GGAACAGGTT ATGGCATAGT CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT 
ACAGATCCTT ATGGTTATTT AGCGCGTGGG GGATTAGTAA AGCGCATTGG CACNAGATTT 
ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT CAGCCAGGT 

EF071-4 (SEQ ID NO:272) 

F KKLMIQLALV 

IGLSLTIPMT AXAYTIEADP INFTYFPGSA SNELIVLHES GNERNLGPHS LDNEVAYMKR 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA GSLANQKAYA QIELARTNNA ATFKKDYAAY 
VNLARDLAQN IGADFSLDDG TGYGIVTHDW ITKNWWGDHT DPYGYLARGG LVKRIGTRFT 
TGVSXTGETG HYSAR 

EF072-1 (SEQ ID NO:273) 

TAATCAATGA AAAACGCACG TTGGTTAAGT ATTTGCGTCA TGCTACTCGC TCTTTTCGGG 
TTTTCACAGC AAGCATTAGC AGAGGCATCG CAAGCAAGCG TTCAAGTTAC GTTGCACAAA 
TTATTGTTCC CTGATGGTCA ATTACCAGAA CAGCAGCAAA ACACAGGGGA AGAGGGAACG 
CTGCTTCAAA ATTATCGGGG CTTAAATGAC GTCACTTATC AAGTCTATGA TGTGACGGAT 
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA ACGGTCCAAG AGGCACAGCG TCAATTAGCA 
GAAACCGGTG CAACAAATAG AAAACCGATC GCAGAAGATA AAACACAGAC AATAAATGGA 
GAAGATGGAG TGGTTTCTTT TTCATTAGCT AGCAAAGATT CGCAGCAACG AGATAAAGCC 
TATTTATTTG TTGAAGCGGA . AGCACCAGAA GTGGTAAAGG AAAAAGCTAG CAACCTAGTA 
GTGATTTTGC CTGTTCAAGA TCCACAAGGG CAATCGTTAA CGCATATTCA TTTATATC C A 
AAAAATGAAG AAAATGCCTA TGACTTACCA CCACTTGAAA AAACGGTACT CGATAAGCAA 
CAAGGCTTTA ATCAAGGAGA GCACATTAAC TATCAGTTAA CGACTCAGAT TCCAGCGAAT 
ATTTTAGGAT ATCAGGAATT CCGTTTGTCA GATAAGGCGG ATACAACGTT GACACTTTTA 
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA AAAACAGTTA CTACAGGTTA CACACTGACG 
ACGCAAAAGC ATGGATTTAC GCTTGATTTT TCAATTAAAG ACTTACAAAA CTTTGCAAAT 
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CAAACAATGA CTGTGTCGTA TCAAATGCGT TTAGAAAAGA CCGCTGAACC TGACACTGCG 
ATTAACAACG AAGGACAATT ' AGTCACGGAC AAACATACCT TGACTAAAAG AGCCACAGTT 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT GATAGTGAAA ATGCGAAAAT CACCTTGCCA 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG GGGGAATACC TCAATGAAAC AGCAAACGGG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT AAAAAATTCA CGTCTAATCA AGCCGGTGAA 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA GTACTTCTTG GAAGAAATCT CTGCACCAAA 
AGGTTATCTT C TGAATC AAA CAGAAATTCC TTTTACGGTG GGAAAAAATT CTTATGCAAC 
GAACGGACAA CGAACAGCAC CGTTACATGT AATCAATAA 



EF072-2 (SEQ ID NO:274) 

MKNARWLSI CVMLLALFGF SQQALAEASQ ASVQVTLHKL LFPDGQLPEQ ■ QQNTGEEGTL 
LQNYRGLNDV TYQVYDVTDP FYQLRSEGKT VQEAQRQLAE TGATNRKPIA EDKTQTINGE 
DGWSFSLAS KDSQQRDKAY LFVEAEAPEV VKEKASNLW ILPVQDPQGQ SLTHIHLYPK 
NEENAYDLPP LEKTVLDKQQ GFNQGEHINY QLTTQIPANI LGYQEFRLSD KADTTLTLLP 
ESIEVKVAGK TVTTGYTLTT QKHGFTLDFS IKDLQNFANQ TMTVSYQMRL EKTAEPDTAI 
NNEGQLVTDK HTLTKRATVR TGGKSFVKVD SENAKITLPE AVFIVKNQAG EYLNETANGY 
RWQKEKALAK KFTSNQAGEF SVKGXKRWPV LLGRNLCTKR LSSESNRNSF YGGKKFLCNE 
RTTNSTVTCN Q 

EF072-3 (SEQ ID NO:275) 

ATTACCAGAA CAGCAGCAAA ACACAGGGGA AGAGGGAACG 

CTGCTTCAAA ATTATCGGGG CTTAAATGAC GTCACTTATC AAGTCTATGA TGTGACGGAT 
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA ACGGTCCAAG AGGCACAGCG TCAATTAGCA 
GAAACCGGTG CAACAAATAG AAAACCGATC GCAGAAGATA AAACACAGAC AATAAATGGA 
GAAGATGGAG TGGTTTCTTT TTCATTAGCT AGCAAAGATT CGCAGCAACG AGATAAAGCC 
TATTTATTTG TTGAAGCGGA AGCACCAGAA GTGGTAAAGG AAAAAGCTAG CAACCTAGTA 
GTGATTTTGC CTGTTCAAGA TCCACAAGGG CAATCGTTAA CGC ATATTC A TTTATATCC A 
AAAAATGAAG AAAATGCCTA TGACTTACCA CCACTTGAAA AAACGGTACT CGATAAGCAA 
CAAGGCTTTA ATCAAGGAGA GCACATTAAC TATCAGTTAA CGACTCAGAT TCCAGCGAAT 
ATTTTAGGAT ATCAGGAATT CCGTTTGTCA GATAAGGCGG ATACAACGTT G AC AC TTTTA 
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA AAAACAGTTA CTACAGGTTA CACACTGACG 
ACGCAAAAGC ATGGATTTAC GCTTGATTTT TCAATTAAAG ACTTACAAAA CTTTGCAAAT 
CAAACAATGA CTGTGTCGTA TCAAATGCGT TTAGAAAAGA CCGCTGAACC TGACACTGCG 
ATTAACAACG AAGGACAATT AGTCACGGAC AAACATACCT TGACTAAAAG AGCCACAGTT 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT GATAGTGAAA ATGCGAAAAT CACCTTGCCA 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG GGGGAATACC TCAATGAAAC AGCAAACGGG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT AAAAAATTCA CGTCTAATCA AGCCGGTGAA 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA GTACTTCTTG GAAGAAATCT CTGCACCAAA 
AGGTTATCTT C TGAATC AAA CAGAAATTCC TTTTACGGTG GGAAAAAATT CTTATGCAAC 
GAACGGACAA CGAACAGCAC CGTTACATGT A 

EF072-4 (SEQ ID NO:276) 

QLPEQ QQNTGEEGTL 

LQNYRGLNDV TYQVYDVTDP FYQLRSEGKT VQEAQRQLAE TGATNRKPIA EDKTQTINGE 
DGWSFSLAS KDSQQRDKAY LFVEAEAPEV VKEKASNLW ILPVQDPQGQ SLTHIHLYPK 
NEENAYDLPP LEKTVLDKQQ GFNQGEHINY QLTTQIPANI LGYQEFRLSD KADTTLTLLP 
ESIEVKVAGK TVTTGYTLTT QKHGFTLDFS IKDLQNFANQ TMTVSYQMRL EKTAEPDTAI 
NNEGQLVTDK HTLTKRATVR TGGKSFVKVD SENAKITLPE AVFIVKNQAG EYLNETANGY 
RWQKEKALAK KFTSNQAGEF SVKGXKRWPV LLGRNLCTKR LSSESNRNSF YGGKKFLCNE 
RTTNSTVTC 
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EF073-1 (SEQ ID NO:277) 

TAAATGAACA AATTAAATAC AAAATTACTG ATTGGCTATA TTCTTTTAGG AGCCTTAATC 
ATTGCTGTCG CTAGAGAATA TGGCTTCTTC GCTTTTGTGA TTCTGGTAGG CTTTTTAGTA 
TTCGTTCTCT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAAT GCCTTACTTA 
ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AATTGATTTT 
TTCAGAAGTA CAATGAGCAC AGCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT 
CGTTCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT 
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT 
ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATCGA ACAACACGAA 
GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TGACCAATTG 
TCAAAATTAG TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATGACTTAGA CGATTTAGAT 
GTCGAAATGT CGATCGCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GGAATCACCT 
CAAGTAAACG AAGACCAGCA ATAA 

EF073-2 (SEQ ID NO:278) 

MNKLNTKLLI GYILLGALII AVAREYGFFA FVILVGFLVF VLYRKKKNAA DKSDQMPYLT 
KDKEAHYREL GLSPQEIDFF RSTMSTAKKQ IIQLQENMNR STKLRAIDLR NDTTKVSKAL 
FKELVKEPKK LHLANHFLYT HLPNIVDLTS KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS 
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS QKAATEESPQ VNEDQQ 

EF073-3 (SEQ ID NO:279) 

CT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAAT GCCTTACTTA 
ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AATTGATTTT 
TTCAGAAGTA CAATGAGCAC AGCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT 
CGTTCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT 
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT 
ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATCGA ACAACACGAA 
GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TGACCAATTG 
TCAAAATTAG TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATGACTTAGA CGATTTAGAT 
GTCGAAATGT CGATCGCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GGAATCACCT 
CAAGTAAACG AAGACCAGCA AT 



EF073-4 (SEQ ID NO:280) 

YRKKKNAA DKSDQMPYLT 
KDKEAHYREL GLSPQEIDFF RSTMSTAKKQ 
FKELVKEPKK LHLANHFLYT HLPNIVDLTS 
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS 

EF074-1 (SEQ ID NO:281) 



IIQLQENMNR STKLRAIDLR NDTTKVSKAL 
KHLEIEQHEV KNKQTYEKLE ESAQIIDQLS 
QKAATEESPQ VNEDQQ 



TAAAGGAGTT CTCAAAAAAT GAAGCTAAAA AAAATAATTC CTGCTTTTCC CCTTCTTTCA 

ACCGTTGCAG TTGGCTTGTG GTTAACGCCT ACTCAAGCTT CTGCAGATGC TGCGGATACG 

ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GGCATAACTG GGCCTCAAAA 

GGACGCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTAAATCAA 

GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG 

TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT 

CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA ATTAGTCAAA 

GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT 

GGTTTAGACA TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAACCA AACCGTCATC 
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CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT 

ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 

TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT 

GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG 
TTCCTCTATN ATATT 

EF074-2 (SEQ ID NO:282) 

MKLKK IIPAFPLLST VAVGLWLTPT QASADAADTM VDISGKKVLV GYWHNWASKG 
RDGYKQGTSA SLNLSEVNQA YNWPVSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 
GRAVLLALGG ADAHIQLVKG DEQAFANEII RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 
ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV 
WVDEVMTWVA QSNDALKYEF LYXI 

EF074-3 (SEQ ID NO:283) 

TGC TGCGGATACG 

ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GGCATAACTG GGCCTCAAAA 
GGACGCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTAAATCAA 
GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG 
TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT 
CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA ATTAGT.CAAA 
GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT 
GGTTTAGACA TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAACCA AACCGTCATC 
CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT 
ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 
TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT 
GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG 
TTCCTCT 



EF074-4 (SEQ ID NO:284) 
AADTM VDISGKKVLV GYWHNWASKG 
RDGYKQGTSA SLNLSEVNQA YNWPVSFMK 
GRAVLLALGG ADAHIQLVKG DEQAFANEII 
ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP 
WVDEVMTWVA QSNDALKYEF LY 



SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 
RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 
GAAYETYITS LNGYYDYIAP QLYNQGGDGV 



EF075-1 (SEQ ID NO:285) 

TAACCTATAA GAAAAAAATC ACAACCTGTG ATAAATTATT GGAGGNAAAA TATGTCAAAA 
GGGAAGAAAA TTTTTGCCAT TATCNTTGGA ATTATCTTGG NTCTATTTCT TGCAGTTGTT 
GGAATGGGAG CAAAACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA 
GTAGAACGAT CTAAAAAAAG TCAGGTCAAT TTAAACAATA AGGAGCCTTT TTCTGTTTTA 
TTATTAGGGA TTGATACAGG CGATGATGGG CGTGTCGAGC AAGGTCGTTC GGATACAACA 
ATTGTTGCAA CAGTTAATCC TCGTGACAAG CAAACAACCT TAGTCAGTCT TGCTCGCGAT 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA CAAGATAAAT TGAATCACGC CTATGCTTTT 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT GAAAACTATT TAAACATACC TATTAATCAT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA GAATTAGTCA ACGCGGTTGG CGGAATCGAA 
GTGAACAA.TA ATCTGACTTT TTCTCAAGAC GGATATGATT TTACGATTGG TAAAATTTCA 
TTGGATGGTG AACAAGCACT CTCCTATTCA AGAATGCGTT ACGAAGACCC TAATGGTGAC 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG ATTGAAGGCA TCGTCCAAAA AGTCTTAAGT 
CTTAACAGCG TAAGCAACTA TCAAGAAATT TTAACAGCTG TTTCTGATAA TATGAAGACA 
GATTTAAGTT TTGATGACAT GAAAAAAATT GCCTTAGATT ATCGCAGTGC CTTTGGTAAA 
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GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGTTTC CTATCAACGT 
GTGGATGAAC AAGAATTAAC TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA 
TAA 

EF075-2 (SEQ ID NO:286) 

MS KG KKIFAIIXGI ILXLFLAWG MGAKLYWDVS KSMDKTYETV 

ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR VEQGRSDTTI VATVNPRDKQ TTLVSLARDT 
YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE NYLNIPINHY VSINMAGLKE LVNAVGGIEV 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVI EGIVQKVLSL 
NSVSNYQEIL TAVSDNMKTD LSFDDMKKIA LDYRSAFGKV KQDQLQGTGF MQDGVSYQRV 
DEQELTRVQQ ELKNQLNTK 



EF075-3 (SEQ ID NO:287) 

ACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA 

GTAGAACGAT CTAAAAAAAG TCAGGTCAAT TTAAACAATA AGGAGCCTTT TTCTGTTTTA 
TTATTAGGGA TTGATACAGG CGATGATGGG CGTGTCGAGC AAGGTCGTTC GGATACAACA 
ATTGTTGCAA CAGTTAATCC TCGTGACAAG CAAACAACCT TAGTCAGTCT TGCTCGCGAT 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA CAAGATAAAT TGAATCACGC CTATGCTTTT 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT GAAAACTATT TAAACATACC TATTAATCAT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA GAATTAGTCA ACGCGGTTGG CGGAATCGAA 
GTGAACAATA ATCTGACTTT TTCTCAAGAC GGATATGATT TTACGATTGG TAAAATTTCA 
TTGGATGGTG AACAAGCACT CTCCTATTCA AGAATGCGTT ACGAAGACCC TAATGGTGAC 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG ATTGAAGGCA TCGTCCAAAA AGTCTTAAGT 
CTTAACAGCG TAAGCAACTA TCAAGAAATT TTAACAGCTG TTTCTGATAA TATGAAGACA 
GATTTAAGTT TTGATGACAT GAAAAAAATT GCCTTAGATT ATCGCAGTGC CTTTGGTAAA 
GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGTTTC CTATCAACGT 
GTGGATGAAC AAGAATTAAC TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA 



EF075-4 (SEQ ID NO: 288) 
KLYWDVS KSMDKTYETV 

ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR VEQGRSDTTI VATVNPRDKQ TTLVSLARDT 
YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE NYLNIPINHY VSINMAGLKE LVNAVGGIEV 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVI EGIVQKVLSL 
NSVSNYQEIL TAVSDNMKTD LSFDDMKKIA LDYRSAFGKV KQDQLQGTGF MQDGVSYQRV 
DEQELTRVQQ ELKNQLNTK 



EF076-1 (SEQ ID NO:289) 

TAGAAAATAA CAGAGGAGCT GAAGGAAATG 
AGCATTGCTG CAGTTGCAAG TGTCTCTGTT 
AAGGTATCTC ATGTTTCCAA TCGTTATAAA 
GGAAAC C AAA AATTATTATC GATTGTCGAT 
TTAAATGTTG TGGATCGTGT GAAAGATGGC 
GTTAAAGACA ATACAGATTC TTTAAAAGAA 
AAGTTAAAAA AGTGGCCTAG GCCATCTTTT 
TAA 



AAAGCATCAA CAAAAATTGG TATCGGTTTA 
GCAGTCATCG CTTCTGAAAA AATTATTAAG 
GTTAAAAAGT TTGTAGACGA TAAATTTGAT 
GATTTATC CG ATGATGAATT AGATTCTGTT 
GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 
CGCTTTTTCA CATTTATTGA AGATGCAATG 
TTTTATAAAA ATAATTCTTT TGTTTCAACA 
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MK ASTKIGIGLS ' IAAVASVSVA VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 

NQKLLS I VDD LSDDELDSVL NWDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 

LKKWPRPSFF YKNNSFVST 

EF076-3 (SEQ ID NO:291) 

CATCG CTTCTGAAAA AATTATTAAG 

AAGGTATCTC ATGTTTCCAA TCGTTATAAA GTTAAAAAGT TTGTAGACGA TAAATTTGAT 
GGAAACCAAA AATTATTATC GATTGTCGAT GATTTATCCG ATGATGAATT AGATTCTGTT 
TTAAATGTTG TGGATCGTGT GAAAGATGGC GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 
GTTAAAGACA ATACAGATTC TTTAAAAGAA CGCTTTTTCA CATTTATTGA AGATGCAATG 
AAGTTAAAAA AGTGGCCTAG GCCATCTTTT TTTTATAAAA ATAATTCTT 



EF076-4 (SEQ ID NO:292) 

VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 
NQKLLS I VDD LSDDELDSVL NWDRVKDGG 
LKKWPRPSFF YKNNS 



SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 



EF077-1 (SEQ ID NO:293) 

TAATGTAAAG TGAATGATGG GAGAGAAAAA GAGATGAAGC ATGTAACAAA ATTGGGGATT 
ACAATTATAA CAGGAGTTTT GGCATTATTA TTTGAATTTA TTTTACATCA GCCGAATTGG 
GCGTATGGCA TTATTTTAAT AACAGGTTCT GTAATGGCGT TAATGATGTT CTGGGAAATG 
ATTCAAACCT TACGTGAAGG AAAATATGGT GTCGATATTT TAGCGATTAC CGCTATCGTT 
GCAACCTTAG CTGTGGGAGA ATACTGGGCC AGTTTGATGA TTTTAATTAT GTTGACTGGT 
GGTGATTCAT TAGAAGACTA TGCCGCTGGA AAAGCTAACC AAGAGCTGAA GTCATTATTG 
GATAACTCGC CACAAAAAGC TCATCGCTTG AATGGCGAAA ATTTAGAAGA TGTTTCTGTT 
GAGGAAATCA ATGTTGGCGA TGAATTAGTA GTAAAACCAG GGGAACTAGT TCCAGTTGAT 
GGCTTGGTAA AAACCGGGAC ATCAACAGTC GATGAATCTT CATTAACAGG AGAATCAAAA 
CCAATTGAAA AAAATC CTGG GGATGAATTA ATGTCGGGTT CCGTGAATGG TGACGGCTCT 
TTGAAAATGG TTGCTGAAAA AACTGTAGCA GACAGTCAAT ATCAAACAAT TGTGAACTTA 
GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT TTTGTACGTT TAGCAGATCG CTATGCGGTA 
CCTTTTACAC TAGTTGCCTA CCTAATTGCA GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG 
ACACGTTTTG CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC CTTTAATTCT ATCTGCCCCA 
ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA AGTCGTCATG GGGTCGTTAT TAAATCGGGA 
ACGATGGTCG AAAAATTAGC TTCTGCAAAA ACGATTGCGT TTGATAAAAC AGGCACGATT 
ACGCAAGGAC AACTTTCTGT TGATCAAGTC CAACCAATCA ATGCTGGAAT AACTGCTGCT 
GAATTAGTGG GATTGGCAGC AAGCGTGGAA CAAGAATCAA GTCATATTTT AGCTAGATCA 
ATTGTTGCTT ATGCCAGAAA GCAAGATGTC CCATTAAAAA ATATTACAGA TCTAGCGGAA 
GTTTCTGGTG CTGGCGTGAA GGCATTTGTG GATGGTGCTG AGATACGGGT AGGTAAAAAG 
AATTTTGTGA CACAAGAGTC TCAAGAAACT GAAAAAATTG ATAAAACGAC TATTCATATT 
TCACGTAATG GCACATATTT AGGCCGAATT ACTTTTACAG ACACTGTACG CCCAGAAGCA 
AAAGAGACTA TGGAAAAATT ACACCAATTA CATCTTCAAC GAATTTTAAT GCTGACGGGG 
GATCAAGAAT CCGTTGCAGA AACGATTGCT GCAGAAGTAG GAATTACCGA AGTACATGGG 
GAATGTTTAC CACAAGATAA ATTAACTATT CTAAAAGAAT TGCCTAAAGA AAATCATCCA 
GTCATCATGG TAGGAGATGG TGTAAATGAT GCACCTTCGC TTGCTGCTGC AGACGTAGGT 
ATTGCTATGG GTGCTCATGG AGCTACTGCG GCTAGTGAAA CTGCTGACGT TGTTATTTTA 
AAAGATGACT TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CCCAAGATAC CATGAAAATT 
GCCAAACAAT CTGTATTAAT CGGAATTTTT ATCTGCGTTT TACTAATGTT AATTGCTAGT 
ACCGGGATCA TTCCGGCGCT AATCGGGGCT ATGCTACAAG AAGTCGTGGA CACTGTGTCA 
ATCTTATCTG CTTTGCGTGC TCGTCGAATT GGCCAGTAA 
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EF077-2 (SEQ ID NO:294) 

MKHVTKLGIT IITGVLALLF EFILHQPNWA YGIILITGSV MALMMFWEMI 
QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD 
NSPQKAHRLN GENLEDVSVE EINVGDELW KPGELVPVDG LVKTGTSTVD ESSLTGESKP 
IEKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV KESAARPAHF VRLADRYAVP 
FTLVAYLIAG VAWFVSKSPT RFAEVLWAS PCPLILSAPI ALVAGMGRSS RHGWIKSGT 
MVEKLASAKT IAFDKTGTIT QGQLSVDQVQ PINAGITAAE LVGLAASVEQ ESSHILARSI 
VAYARKQDVP LKNITDLAEV SGAGVKAFVD GAEIRVGKKN FVTQESQETE KIDKTTIHIS 
RNGTYLGRIT FTDTVRPEAK ETMEKLHQLH LQRILMLTGD QESVAETIAA EVGITEVHGE 
CLPQDKLTIL KELPKENHPV IMVGDGVNDA PSLAAADVGI AMGAHGATAA SETADWILK 
DDLSKVSQAV EIAQDTMKIA KQSVLIGIFI CVLLMLIAST GIIPALIGAM LQEWDTVSI 
LSALRARRIG Q 

EF077-3 (SEQ ID NO:295) 

TCA GCCGAATTGG 

GCGTATGGCA TTATTTTAAT AACAGGTTCT GTAATGGCGT TAATGATGTT CTGGGAAATG 
ATTCAAACCT TACGTGAAGG AAAATATGGT GTCGATATTT TAGCGATTAC CGCTATCGTT 
GCAACCTTAG CTGTGGGAGA ATACTGGGCC AGTTTGATGA TTTTAATTAT GTTGACTGGT 
GGTGATTCAT TAGAAGACTA TGCCGCTGGA AAAGCTAACC AAGAGCTGAA GTCATTATTG 
GATAACTCGC CACAAAAAGC TCATCGCTTG AATGGCGAAA ATTTAGAAGA TGTTTCTGTT 
GAGGAAATCA ATGTTGGCGA TGAATTAGTA GTAAAACCAG GGGAACTAGT TCCAGTTGAT 
GGCTTGGTAA AAACCGGGAC ATCAACAGTC GATGAATCTT CATTAACAGG AGAATCAAAA 
CCAATTGAAA AAAATCCTGG GGATGAATTA ATGTCGGGTT CCGTGAATGG TGACGGCTCT 
TTGAAAATGG TTGCTGAAAA AACTGTAGCA GACAGTCAAT ATCAAACAAT TGTGAACTTA 
GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT TTTGTACGTT TAGCAGATCG CTATGCGGTA 
CCTTTTACAC TAGTTGCCTA CCTAATTGCA GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG 
ACACGTTTTG CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC CTTTAATTCT ATCTGCCCCA 
ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA AGTCGTCATG GGGTCGTTAT TAAATCGGGA 
ACGATGGTCG AAAAATTAGC TTCTGCAAAA ACGATTGCGT TTGATAAAAC AGGCACGATT 
ACGCAAGGAC AACTTTCTGT TGATCAAGTC CAACCAATCA ATGCTGGAAT AACTGCTGCT 
GAATTAGTGG GATTGGCAGC AAGCGTGGAA CAAGAATCAA GTCATATTTT AGC TAG ATC A 
ATTGTTGCTT ATGCCAGAAA GCAAGATGTC CCATTAAAAA ATATTACAGA TCTAGCGGAA 
GTTTCTGGTG CTGGCGTGAA GGCATTTGTG GATGGTGCTG AGATACGGGT AGGTAAAAAG 
AATTTTGTGA CACAAGAGTC TCAAGAAACT GAAAAAATTG ATAAAACGAC TATTCATATT 
TCACGTAATG GCACATATTT AGGCCGAATT ACTTTTACAG ACACTGTACG CCCAGAAGCA 
AAAGAGACTA TGGAAAAATT ACACCAATTA CATCTTCAAC GAATTTTAAT GCTGACGGGG 
GATCAAGAAT CCGTTGCAGA AACGATTGCT GCAGAAGTAG GAATTACCGA AGTACATGGG 
GAATGTTTAC CACAAGATAA ATTAACTATT CTAAAAGAAT TGCCTAAAGA AAATCATCCA 
GTCATCATGG TAGGAGATGG TGTAAATGAT GCACCTTCGC TTGCTGCTGC AGACGTAGGT 
ATTGCTATGG GTGCTCATGG AGCTACTGCG GCTAGTGAAA CTGCTGACGT TGTTATTTTA 
AAAGATGACT TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CCCAAGATAC CATGAAAATT 
GCCAAACAAT CTGTATTAAT CGGAATTTTT ATCTGCGTTT TACTAATGTT AATTGCTAGT 
ACCGGGATCA TTCCGGCGCT AATCGGGGCT ATGCTACAAG AAGTCGTGGA CACTGTGTCA 
ATCTTATCTG CTTTGCGTGC TCGTCGAATT GGCC 

EF077-4 (SEQ ID NO:296) 

QPNWA YGIILITGSV MALMMFWEMI 

QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD 
NSPQKAHRLN GENLEDVSVE EINVGDELW KPGELVPVDG LVKTGTSTVD ESSLTGESKP 
IEKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV KESAARPAHF VRLADRYAVP 
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FTLVAYLIAG VAWFVSKSPT RFAEVLWAS PCPLILSAPI ALVAGMGRSS RHGWIKSGT 
MVEKLASAKT IAFDKTGTIT QGQLSVDQVQ PINAGITAAE LVGLAASVEQ ESSHILARSI 
VAYARKQDVP LKNITDLAEV SGAGVKAFVD GAEIRVGKKN FVTQESQETE KIDKTTIHIS 
RNGTYLGRIT FTDTVRPEAK ETMEKLHQLH LQRILMLTGD QESVAETIAA EVGITEVHGE 
CLPQDKLTIL KELPKENHPV I MVGDGVNDA PSLAAADVGI AMGAHGATAA SETADWILK 
DDLSKVSQAV EIAQDTMKIA KQSVLIGIFI CVLLMLIAST GIIPALIGAM LQEWDTVS I 
LSALRARRIG 

EF079-1 (SEQ ID NO:297) 

TAATTTCTAG CATCACCGAA GAAATTTTTA GAAAAACAAA GAGCCTGGGC CAATCACTGT 
CCCAGGCTCT CATGCTTTAT TTTTAAGGAG GAAGCAATGA AGTCAAAAAA GAAACGTCGT 
ATCATTGATG GTTTTATGAT TCTTTTACTG ATTATTGGAA TAGGTGCATT TGCGTATCCT 
TTTGTTAGCG ATGCATTAAA TAACTATCTG GATCAACAAA TTATCGCTCA TTATCAAGCA 
AAAGCAAGCC AAGAAAACAC CAAAGAAATG GCTGAACTTC AAGAAAAAAT GGAAAAGAAA 
AACCAAGAAT TAGCGAAAAA AGGCAGCAAT CCTGGATTAG ATCCTTTTTC TGAAACGCAA 
AAAACAACGA AAAAACCAGA CAAATCCTAT TTTGAAAGTC ATACGATTGG TGTTTTAACC 
ATTCCAAAAA TAAATGTCCG TTTACCAATT TTTGATAAAA CGAATGCATT GCTATTGGAA 
AAAGGAAGCT CCTTGTTAGA AGGAACCTCC TATCCTACAG GTGGTACGAA TACACATGCG 
GTCATTTCAG GCCATCGTGG TCTCCCTCAA GCCAAATTAT TTACAGATTT GCC AGAATTA 
AAAAAAGGCG ATGAATTTTA TATCGAAGTC AATGGGAAGA CGCTTGCTTA TCAAGTAGAT 
CAAATAAAAA CCGTTGAACC AACTGATACA AAAGATTTAC ACATTGAGTC TGGCCAAGAT 
CTCGTCACTT TATTAACTTG CACACCGTAT ATGATAAACA GTCATCGGTT ATTAGTTCGA 
GGACATCGTA TCCCATATCA ACCAGAAAAA GCAGCAGCGG GGATGAAAAA AGTGGCACAA 
CAACAAAATT TACTATTATG GACATTACTT TTAATTGCCT GTGCGTTAAT TATTAGCGGC 
TTCATTATCT GGTACAAGCG ACGGAAAAAG ACGACCAGAA .AACCAAAGTA G 

EF079-2 (SEQ ID NO:298) 

MKSKKKRRI IDGFMIuLLI IGIGAFAYPF 

VSDALNNYLD QQIIAHYQAK ASQENTKEMA ELQEKMEKKN QELAKKGSNP. GLDPFSETQK 
TTKKPDKSYF ESHTIGVLTI PKINVRLPIF DKTNALLLEK GSSLLEGTSY PTGGTNTHAV 
ISGHRGLPQA KLFTDLPELK KGDEFYIEVN GKTLAYQVDQ IKTVEPTDTK DLHIESGQDL 
VTLLTCTPYM INSHRLLVRG HRIPYQPEKA AAGMKKVAQQ QNLLLWTLLL IACALIISGF 
I IWYKRRKKT TRKPK 

EF079-3 (SEQ ID NO:299) 

TCCT 

TTTGTTAGCG ATGCATTAAA TAACTATCTG GATCAACAAA TTATCGCTCA TTATCAAGCA 
AAAGCAAGCC AAGAAAACAC CAAAGAAATG GCTGAACTTC AAGAAAAAAT GGAAAAGAAA 
AACCAAGAAT TAGCGAAAAA AGGCAGCAAT CCTGGATTAG ATCCTTTTTC TGAAACGCAA 
AAAACAACGA AAAAACCAGA CAAATCCTAT TTTGAAAGTC ATACGATTGG TGTTTTAACC 
ATTCCAAAAA TAAATGTCCG TTTACCAATT TTTGATAAAA CGAATGCATT GCTATTGGAA 
AAAGGAAGCT CCTTGTTAGA AGGAACCTCC TATCCTACAG GTGGTACGAA TACACATGCG 
GTCATTTCAG. GCCATCGTGG TCTCCCTCAA GCCAAATTAT TTACAGATTT GCCAGAATTA 
AAAAAAGGCG ATGAATTTTA TATCGAAGTC AATGGGAAGA CGCTTGCTTA TCAAGTAGAT 
CAAATAAAAA CCGTTGAACC AACTGATACA AAAGATTTAC ACATTGAGTC TGGCCAAGAT 
CTCGTCACTT TATTAACTTG CACACCGTAT ATGATAAACA GTCATCGGTT ATTAGTTCGA 
GGACATCGTA TCCCATATCA ACCAGAAAAA GCAGCAGCGG GGATGAAAAA AGTGGCACAA 
CAACAAAATT TACTATTATG GACATTACTT TTAATTGCCT GTGCGTTAAT TATTAGCGGC 
TTCATTATCT GGTACAAGCG ACGGAAAAAG ACGACCAGAA AACCAA 

EF079-4 (SEQ ID NO:300) 
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PF 

VSDALNNYLD QQIIAHYQAK ASQENTKEMA ELQEKMEKKN QELAKKGSNP GLDPFSETQK 
tSSSf SSSti PKINVRLPIF DKTNALLLEK GSSLLEGTSY PTGGTNTHAV 

x™pS klftdlpelk kgdefyievn gktlayqvdq ikivbptdtk dlhiesgqdl 

VTLLTCTPYM INSHRLLVRG HRIPYQPEKA AAGMKKVAQQ QNLLLWTLLL IACALIISGF 
I IWYKRRKKT TRKP 

EF080-1 (SEQ ID NO:301) 

TAGTTACACT CGTTTAGGGC TAGCAACGTT AGGCATTTTC GCTGGACTCT TAGCACTCTT 
TTTATTAGGA GGTTATTTCC TATGAAAAAA CGACTTTTAC CTATTTTTTT CCTAATACTT 
CTTACCTTTG GCCTTGCCCT ACCCGTTTCG GCGGCTGAAA ATTCAATTGA TGATGGCGCA 
CAATTACTGA CACCTGATCA AATCAACCAA CTAAAGCAAG AGATACAACC ™AAGAA 
AAAACAAAAG CCTCTGTCTT TATTGTAAC C ACAAATAATA ATACCTATOO CGATGAACAA 
GAATATGCAG ATCATTATCT TTTAAATAAA GTTGGCAAGG ACCAAAATGC GATTCTTTTT 
TGGACTTACG GAAAATCTAC ATCTCTACTT CTGGAAACAT GATTGATTAT 
ATGACAGATG CACGAATTGA TG AT AC C IT A GATAAAATAT GGGATAATAT GAGTCAAGGA 
AATTATTTCG CGGCTGCTCA AACCTTTGTT CAGGAAACTC AAGCATTTGT TAATAAAGGG 
GTTC C TGGGG GGCACTATCG TGTGGACAGC GAAACAGGTA AAATCACTCG TTATAAAGTC 
ATTACCCCGC TGGAAATGGT AATTGCTTTT GCTGCTGCGC TGATACTCAG TTTGGTCTTC 
TTAGGCATTA ATATTTC TAA ATATCAATTA AAATTTTCAA GTTATCAATA TCCCTTTAGG 

gaaaaScaa ctttaaactt aacctcccgc acagatcagt taaccaactc tttcatcact 

ACGCGTCGTA TTCCTAAAAA CAATGGCGGC AGTGGCGGAA TGGGCGGTGG TGGTAGCACC 
ACCCACTCAA CTGGCGGCGG CACATTCGGT GGCGGCGGTC GAAGTTTTTA G 

EF080-2. (SEQ ID NO:302) 

MVKR T.T.PIFFLILL TFGLALPVSA AENSIDDGAQ 

L^PDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE YADHYLLNKV GKDQNAILFL 

idmdlrSS SsgSidym tdariddtld kiwdnmsqgn yfaaaqtfvq etqafvnkgv 

PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL GINISKYQLK FSSYQYPFRE 
KTTLNLTSRT DQLTNSFITT RRIPKNNGGS GGMGGGGSTT HSTGGGTFGG. GGRSF 



EF080-3 (SEQ ID NO:303) 

GGCTGAAA ATTCAATTGA TGATGGCGCA 
CAATTACTGA CACCTGATCA AATCAACCAA 
AAAACAAAAG CCTCTGTCTT TATTGTAACC 
GAATATGCAG ATCATTATCT TTTAAATAAA 
CTCATTGATA TGGACTTACG GAAAATCTAC 
ATGACAGATG CACGAATTGA TGATACCTTA 
AATTATTTCG CGGCTGCTCA AACCTTTGTT 
GTTCCTGGGG GGCACTATCG TGTGGACAGC 
ATTACCCCGC TGGAAATGGT AATTGCTTTT 
TTAGGCATTA ATATTTC TAA ATATCAATTA 
GAAAAAACAA CTTTAAACTT AACCTCCCGC 
ACGCGTCGTA TTCCTAAAAA CAATGGCGGC 
ACCCACTCAA CTGGCGGCGG CACATTCGGT 

EF080-4 (SEQ ID NO:304)" 



CTAAAGCAAG AGATACAACC TTTAGAAGAA 
ACAAATAATA ATACCTATGG CGATGAACAA 
GTTGGCAAGG ACCAAAATGC GATTCTTTTT 
ATCTCTACTT CTGGAAACAT GATTGATTAT 
GATAAAATAT GGGATAATAT GAGTCAAGGA 
CAGGAAACTC AAGCATTTGT TAATAAAGGG 
GAAACAGGTA AAATCACTCG TTATAAAGTC 
GCTGCTGCGC TGATACTCAG TTTGGTCTTC 
AAATTTTCAA GTTATCAATA TCCCTTTAGG 
ACAGATCAGT TAACCAACTC TTTCATCACT 
AGTGGCGGAA TGGGCGGTGG TGGTAGCACC 
GGCGGCGGTC GAAGT 



LLTPDQINQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE YADHYLLNKV GKDQNAILFL 



WO 98/50554 



PCT/US98/08959 



171 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

IDMDLRKIYI STSGNMIDYM TDARIDDTLD KIWDNMSQGN YFAAAQTFVQ ETQAFVNKGV 
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL GlNISKYQLK FSSYQYPFRE 
KTTLNLTSRT DQLTNSFITT RRIPKNNGGS GGMGGGGSTT HSTGGGTFGG GGRS 

EF081-1 (SEQ ID NO:305) 

TGAATGGAAC GAAGCAATCG TAATAAAAAA TCTTCAAAAA AACCACTTAT TCTTGGTGTT 
TCTGCCTTGG TTCTAATCGC TGCTGCCGGT GGCGGGTATT ATGCTTATAG TCAATGGCAA 
GCCAAACAAG AATTAGCCGA AGCGAAGAAA ACAGCTACTA CATTTTTAAA CGTATTGTCA 
AAACAGGAAT TTGATAAGTT ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 
TATGATACTA AATCTGTTGT TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 
GGAGTCAAAG CTAGTGATGT TCAAGTCAAA AAGGCGAAAG ACAATCAATA CACATTTACC 
TATAAATTAT CGATGAGCAC GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAATCAAGT 
ATCGCCAAAA AAGGCGATAC CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 
ATGTCAGGAA ATGATAAAAT TTCGATTCAA GTAGATAATG CCAAACGTGG AGAAATTGTC 
GATCGTAATG GTAGTGGGCT AGCAATTAAC AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT 
GGCAAACTCG GTTCTGGCGC AGAAAAAACA GCCAATATCA AAGCTTTTAG TGATAAATTC 
GGCGTTTCTG TTGATGAAAT CAATCAAAAG TTAAGCCAAG GATGGGTCCA AGCAGACTCC 
TTTGTACCAA TCACAGTCGC TTCTGAACCA GTGACAGAAT TACCAACAGG GGCTGCGACA 
AAAGATACAG AGTCACGTTA TTATCCGCTG GGGGAAGCAN TGCGCAATTA A 

EF081-2 (SEQ ID NO: 306) 

MERSNRNKKS SKKPLILGVS ALVL I AAAGG GYYAYSQWQA KQELAEAKKT ATTFLNVLSK 

qIfd™ qeaslkkngy dtkswekyq aiysgiqaeg vkasdvqvkk akdnqytfty 

KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA WKPSLIFPDM SGNDKISIQV DNAKRGEIVD 
RNGSGL A I NK VFDEVGWPG KLGSGAEKTA NIKAFSDKFG VSVDEINQKL SQGWVQADSF 
VPITVASEPV TELPTGAATK DTESRYYPLG EAXRN 

EF081-3 (SEQ ID NO:307) 

T GGCGGGTATT ATGCTTATAG TCAATGGCAA 

GCCAAACAAG AATTAGCCGA AGCGAAGAAA ACAGCTACTA CATTTTTAAA CGTATTGTCA 
AAACAGGAAT TTGATAAGTT ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 
TATGATACTA AATCTGTTGT TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 
GGAGTCAAAG CTAGTGATGT TCAAGTCAAA AAGGCGAAAG ACAATCAATA CACATTTACC 
TATAAATTAT CGATGAGCAC GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAATCAAGT 
ATCGCCAAAA AAGGCGATAC CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 
ATGTCAGGAA ATGATAAAAT TTCGATTCAA GTAGATAATG CCAAACGTGG AGAAATTGTC 
GATCGTAATG GTAGTGGGCT AGCAATTAAC AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT 
GGCAAACTCG GTTCTGGCGC AGAAAAAACA GCCAATATCA AAGCTTTTAG TGATAAATTC 
GGCGTTTCTG TTGATGAAAT CAATCAAAAG TTAAGCCAAG GATGGGTCCA AGCAGACTCC 
TTTGTACCAA TCACAGTCGC TTCTGAACCA GTGACAGAAT TACCAACAGG GGCTGCGACA 
AAAGATACAG AGTCACGTTA TTATCCGCTG GGGG 

EF081-4 (SEQ ID NO:308) 

G GYYAYSQWQA KQELAEAKKT ATTFLNVLSK __„ mT ,™ 
QEFDKLPSW QEASLKKNGY DTKSWEKYQ AIYSGIQAEG VKASDVQVKK AKDNQYTFTY 
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA WKPSLIFPDM SGNDKISIQV DNAKRGEIVD 
RNGSGLAINK VFDEVGWPG KLGSGAEKTA NIKAFSDKFG VSVDEINQKL SQGWVQADSF 
VPITVASEPV TELPTGAATK DTESRYYPLG 
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EF082-1 (SEQ ID NO:309) 

TAAAAAATGA AAAAGATCGT GCGCATTTCA AGCATTTTGT TCGTTGCTAC GCCTCTTATG 
C TTTT AAATA GTTCAAAAGT TGAAGCAGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 
ATTACGTTTG CTCTTGATAA TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG 
CCTGTGACAC CTAATCCTGC TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT 
GACTATGTTT CAAATATCCA TTTTGGATCA AAACAAATTC AAGCCGGAAC AGCGATCTAT 
TCGGCACAAC TGGATCAAGT GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT 
GTTCAAGTAA CTGACAAACG TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT 
GCGCAATTTG CTACAAGTGA TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT 
TTAGCAGCAA CACCCAATTC AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA 
ACCTTGGATC CAACTGGTGC CGCCACTTCT CCTGTGGCGA CTGCCGCTCT TTCAACAGGA 
ATGGGCACTT GGACATTAGC TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA 
ACTGTTCCTG CGACAACGAA AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG 
ATTTTGGATG ATACACCACT TTAA 

EF082-2 (SEQ ID NO:310) 

MKKIVRISS ILFVATPLML LNSSKVEAAQ VASIQSNADI TFALDNTVTP PVNPTNPSQP 
VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV 
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT 
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI 
LDDTPL 

EF082-3 (SEQ ID NO:311) 

AGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 

ATTACGTTTG CTCTTGATAA TACTGTCACG CCACCTGTCA ACCCGACGAA CCCTTCTCAG 
CCTGTGACAC CTAATCCTGC TGATCCTCAT CAACCTGGTA CAGCCGGACC CCTTAGTATT 
GACTATGTTT CAAATATCCA TTTTGGATCA AAACAAATTC AAGCCGGAAC AGCGATCTAT 
TCGGCACAAC TGGATCAAGT GCAAAATAGT ACTGGCGATT TAATTAGCGT GCCAAACTAT 
GTTCAAGTAA CTGACAAACG TGGTCTAAAT CTTGGCTGGA AATTATCAGT TAAACAGAGT 
GCGCAATTTG CTACAAGTGA TTCAACACCC GCTGTTTTGG ATAATGCATC CTTGACCTTT 
TTAGCAGCAA CACCCAATTC AACACAGTTA CTTTCTTTGG CGCCATTAAC GGTCCCAGTA 
ACCTTGGATC CAACTGGTGC CGCCACTTCT CCTGTGGCGA CTGCCGCTCT TTCAACAGGA 
ATGGGCACTT GGACATTAGC TTTTGGTAGC GGANCGACCG CTGCTCAAGG CATTCAATTA 
ACTGTTCCTG CGACAACGAA AAAAGTTGCA GCTAAACAAT ATAAAACAAC GCTTACTTGG 
ATTTTGGATG ATACACCACT 

EF082-4 (SEQ ID NO:312) 

AQ VASIQSNADI TFALDNTVTP PVNPTNPSQP 

VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV 
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT 
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI 
LDDTP 

EF083-1 (SEQ ID NO:313) 

TAATTTAAAA GACAAGGAGA AATAAAAATG AAAAAGAAAA TTTTAGCAGG AGCGCTTGTC 
GCTCTGTTTT TTATGCCTAC AGC TATGTTT GCCGCAAAAG GAGACCAAGG TGTGGATTGG 
GCGATTTATC AAGGTGAACA AGGTCGCTTT GGCTATGCAC ATGATAAATT CGCTATTGCC 
CAGATTGGAG GCTACAATGC TAGCGGTATT TATGAACAAT ACACATATAA AACGCAAGTG 
GCAAGTGCTA TTGCCCAAGG TAAACGTGCG CATACCTATA TTTGGTATGA CACTTGGGGA 
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AACATGGACA TTGCGAAAAC AACAATGGAT TACTTTTTGC CACGTATTCA AACGCCTAAA 
aaSStcg ttgcattaga TTTTGAACAT GGAGCGTTGG CTAGTGTTCC AGATGGATAT 
GGAGGATATG TAAGTTCAGA TGCCGAAAAA GCAGCAAATA CAGAGACAAT TTTGTACGGT 

a?52gSgaa ^caggc tggctatact ccaatgtatt acagctataa gccatttaca 
cSaaSato taaactatca acaaatcatc aaagagtttc ctaactcttt atggattgct 
gcgtatccta tcgatggtgt gtcaccatat ccattgtatg cttatttccc aagcatggat 
gIta^ggta tttggcaatt cacatccgct tatattgcag gtggtttaga tggtaacgta 

GATTTAACAG GAATTACGGA TAGTGGTTAT ACAGATACCA ATAAACCAGA AACGGATACG 
CCAGCAACAG ATGCAGGCGA agaaattgaa aaaataccta attctgatgt taaagttggc 

GATACCGTCA AAGTGAAATT TAATGTAGAT GCTTGGGCAA CTGGGGAAGC TATTCCGCAA 

tgggtaaaag gaaacagcta caaagtgcaa gaagtaactg gaagcagagt attgcttgaa 

GGTATCTTGT CATGGATTAG CAAAGGTGAT ATTGAATTAT TGCCAGACGC AACAGTCGTC 
CCTGATAAGC AACCAGAAGC GACTCATGTG GTACAATACG GAGAAACATT ATCAAGTATT 
GCTTATCAAT ATGGAACAGA CTATCAAACG TTGGCGGCAT TAAATGGATT GGCTAATCCA 
AATCTTATTT ATCCTGGTCA AGTTTTGAAA GTCAATGGAT CGGCAACAAG TAATGTCTAC 
ACGGTTAAAT ACGGCGATAA TTTATCTAGT ATTGCAGCAA AACTTGGCAC TACTTATCAA 
GCTTTAGCTG CATTAAACGG ATTAGCAAAT C C TAACTTG A TTTATCCAGG TCAAACATTG 
AATTATTAA 

EF083-2 (SEQ ID NO-.314) 

MK KK I LAG ALVA LFFMPTAMFA AKGDQGVDWA IYQGEQGRFG YAHDKFAI AQ __„ M 
IGGYNASGIY EQYTYKTQVA SAIAQGKRAH TYIWYDTWGN MD I AKTTMDY FLPRIQTPKN 
SIVALDFEHG ALASVPDGYG GYVSSDAEKA ANTETILYGM RRIKQAGYTP MYYSYKPFTL 
NHVNYQQIIK EFPNSLWIAA YPIDGVSPYP LYAYFPSMDG IGIWQFTSAY ^GGLDGNVD 
LTGITDSGYT DTNKPETDTP ATDAGEEIEK IPNSDVKVGD TVKVKFNVDA WATGEAIPQW 
VKGNSYKVQE VTGSRVLLEG ILSWISKGDI ELLPDATWP DKQPEATHW QVGETLSSIA 
YQYGTDYQTL AALNGLANPN LIYPGQVLKV NGSATSNVYT VKYGDNLSSI AAKLGTTYQA 
LAALNGLANP NLIYPGQTLN Y 

EF083-3 (SEQ ID NO:315) 

AAAAG GAGACCAAGG TGTGGATTGG 

GCGATTTATC AAGGTGAACA AGGTCGCTTT GGCTATGCAC ATGATAAATT CGCTATTGCC 
CAGATTGGAG GCTACAATGC TAGCGGTATT TATGAACAAT ACACATATAA AACGCAAGTG 
GCAAGTGCTA TTGCCCAAGG TAAACGTGCG CATACCTATA TTTGGTATGA CACTTGGGGA 
AACATGGACA TTGCGAAAAC AACAATGGAT TACTTTTTGC CACGTATTCA AACGCCTAAA 
AATTCCATCG TTGCATTAGA TTTTGAACAT GGAGCGTTGG CTAGTGTTCC AGATGGATAT 
GGAGGATATG TAAGTTCAGA TGCCGAAAAA GCAGCAAATA CAGAGACAAT TTTGTACGGT 
ATGCGCAGAA TCAAACAGGC TGGCTATACT CCAATGTATT ACAGCTATAA GCCATTTACA 
CTAAATCATG TAAACTATCA ACAAATCATC AAAGAGTTTC CTAACTCTTT ATGGATTGCT 
GCGTATCCTA TCGATGGTGT GTCACCATAT CCATTGTATG CTTATTTCCC AAGCATGGAT 
GGTATTGGTA TTTGGCAATT CACATCCGCT TATATTGCAG GTGGTTTAGA TGGTAACGTA 
GATTTAACAG GAATTACGGA TAGTGGTTAT ACAGATACCA ATAAACCAGA AACGGATACG 
CCAGCAACAG ATGCAGGCGA AGAAATTGAA AAAATACCTA ATTCTGATGT TAAAGTTGGC 
GATACCGTCA AAGTGAAATT TAATGTAGAT GCTTGGGCAA CTGGGGAAGC TATTCCGCAA 
TGGGTAAAAG GAAACAGCTA CAAAGTGCAA GAAGTAACTG GAAGCAGAGT ATTGCTTGAA 
GGTATCTTGT CATGGATTAG CAAAGGTGAT ATTGAATTAT TGCCAGACGC AACAGTCGTC 
CCTGATAAGC AACCAGAAGC GACTCATGTG GTACAATACG GAGAAACATT ATCAAGTATT 
GCTTATCAAT ATGGAACAGA CTATCAAACG TTGGCGGCAT TAAATGGATT GGCTAATCCA 
AATCTTATTT ATCCTGGTCA AGTTTTGAAA GTCAATGGAT CGGCAACAAG TAATGTCTAC 
ACGGTTAAAT ACGGCGATAA TTTATCTAGT ATTGCAGCAA AACTTGGCAC TACTTATCAA 
GCTTTAGCTG CATTAAACGG ATTAGCAAAT CCTAACTTGA TTTATCCAGG TCAAACATTG 
AAT 
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EF083-4 (SEQ ID NO: 316) 

1 ^^ I %^*SE£& — MDIAKTTMDY FLPRIQTPKN 

Sdfehg alasvpdgyg gyvssdaeka antetilygm rrikqagytp myysykpftl 

MHWOOIIK EFPNSLWIAA YPIDGVSPYP LYAYFPSMDG IGIWQFTSAY IAGGLDGNVD 

Speto^p atdageeiek ipnsdvkvgd tvkvkfnvda watgeaipqw 

™ WGSRVLLEG ILSWISKGDI ELLPDATWP DKQPEATHW QYGETLSSIA 
"l AALNGLANPN LIYPGQVLKV NGSATSNVYT VKYGDNLSSI AAKLGTTYQA 
LAALNGLANP NLIYPGQTLN 

EF084-1 (SEQ ID NO: 3 17) 

TAGTCAAACG TTTATTTTTT CCTTAAATCC AGAAAAAATC CCGTAATTAT GGTACACTAC 

Sat^aItt ggaggagaac tatgaagaaa tttgatgtaa ttattgtcgg tgctgggacg 
aSggtatga tggccacgat tgcggccgcc gaagcaggcg ctcaagtatt attgattgaa 
?aaaa?cgcc ctgttgggaa aaaattatta atgactggtg gcggccgctg taatgtaacc 

i^TCGGC cScISaGA AATCATTTCA TTTATTCCTG GGAATGGAAA ATTTTTATAC 
AGCGCATTTT CACAATTTGA TAACTATGAT ATCATGAACT TTTTTGAATC CAATGGTATT 
CACTTAAAAG ^GATCA CGGACGCATG TTCCCTGTTA CAGATAAATC GAAGTCAATT 

StgI^c tatttaaccg cattaacgaa ttaggagtca ctgtttttac aaaaacacag 
otcacaaaat tactacgaaa agacgatcaa ataattggcg ttgaaaccga actggaaaaa 
cgtgtgttgt attaacaact ggcggccgca cttatccttc cacaggagca 
a^gtgSg gctataaact agccaaaaaa atggggcata ccatcagccc gctctaccct 
acSaa^ac ctattatttc tgaagaacct tttatcctgg ataaaacgtt gcaaggtctc 

TCTTTACAAG ATGTTAATTT AACTGTTTTG AACCAAAAAG GAAAACCTTT AGTTAATCAT 

cZTtgI^I Jgctgtttac acattttggc atttcaggac ctgccgcgct ccgctgttct 

AG^SaTTA ACCAAGAATT AACTCGCAAC GGTAATCAAC CTGTCACGGT AGCCTTGGAT 

o?™ga caaaatcttt tgaagaagtg cctgccaaac aactaacaga aaagcaacgn 

CTTTCCTTTG TGGAACTACT GAAAGACTTT CAGTTCACTG TTACGAAAAC ATTGCCTTTG 

gaUaatcS ttgtcacagg cggtgggatt tccctcaaag aagtgacccc taaaacaatg 

rATAGCAAAT TAGTCAATGG TTTATTTTTT GCTGGTGAAC TTTTAGATAT TAATGGCTAT 

SSSSSS IcaItStac agctgcattt gtcactggac atgttgctgg ctcccatgcc 

GCAGAAATTG CAGAATACAC CTATTTACCA ATTGAAGAAG TCTAA 
EF084-2 (SEQ ID NO: 318) 

.Sg™^™ AFSQFDNYDI = SNGIH 

LKEEDHGRMF PVTDKSKSIV DALFNRINEL GVTVFTKTQV TKLLRKDDQI IGVETELEKI 
YAPCWLT^ GRTYPSTGAT GDGYKLAKKM GHTISPLYPT ESPIISEEPF ™QGLS 
Sdvnltvln QKGKPLVNHQ MDMLFTHFGI SGPAALRCSS FINQELTRNG nqpvtvaldv 
fSfeevp akqltekqrl sfvellkdfq ftvtktlple ksfvtgggis lkevtpktme 

SKLVNGLFFA GELLDINGYT GGYNVTAAFV TGHVAGSHAA EIAEYTYLPI EEV. 

EF084-3 (SEQ ID NO: 319) 

r fJAAGCAGGCG CTCAAGTATT ATTGATTGAA 

AAAAATCGCC GTGTTGGGAA AAAATTATTA ATGACTGGTG GCGGCCGCTG TAATGTAACC 
^TCGGC CCGCAGAAGA AATCATTTCA TTTATTCCTG ^AATGGAAA ^TTTATAC 
AGCGCATTTT CACAATTTGA TAACTATGAT ATCATGAACT TTTTTGAATC CAATGGTATT 
ScTtS AAGAAGATCA CGGACGCATG TTCCCTGTTA CAGATAAATC GAAGTCAATT 
t^Z^C TATTTAACCG CATTAACGAA TTAGGAGTCA CTGTTTTTAC AAAAACACAG 
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GTCACAAAAT TACTACGAAA AGACGATCAA ATAATTGGCG TTGAAACCGA ACTGGAAAAA 
ATTTATGCAC CGTGTGTTGT ATTAACAACT GGCGGCCGCA CTTATCCTTC CACAGGAGCA 
AC TGGTG ATG GCTATAAACT AGCCAAAAAA ATGGGGCATA CCATCAGCCC GCTCTACCCT 
ACCGAATCAC CTATTATTTC TGAAGAACCT TTTATCCTGG ATAAAACGTT GCAAGGTCTC 
TCTTTACAAG ATGTTAATTT AACTGTTTTG AACCAAAAAG GAAAACCTTT AGTTAATCAT 
CAAATGGATA TGCTGTTTAC ACATTTTGGC ATTTCAGGAC CTGCCGCGCT CCGCTGTTCT 
AGTTTTATTA ACCAAGAATT AACTCGCAAC GGTAATCAAC CTGTCACGGT AGCCTTGGAT 
GTGTTTCCGA CAAAATCTTT TGAAGAAGTG CCTGCCAAAC AACTAACAGA AAAGCAACGN 
CTTTCCTTTG TGGAACTACT GAAAGACTTT CAGTTCACTG TTACGAAAAC ATTGCCTTTG 
GAAAAATCTT TTGTCACAGG CGGTGGGATT TCCCTCAAAG AAGTGACCCC TAAAACAATG 
GAGAGCAAAT TAGTCAATGG TTTATTTTTT GCTGGTGAAC TTTTAGATAT TAATGGCTAT 
AC TGG AGGCT ACAATGTTAC AGCTGCATTT GTCACTGGAC ATGTTGCTGG CTCCCATGCC 
GCAGAAATTG CAGAATACAC CTATTTACCA ATTGAAGAAG TC 

EF084-4 (SEQ ID NO:320) 



NRRVGKKLLM TGGGRCNVTN NRPAEEIISF IPGNGKFLYS AFSQFDNYDI MNFFESNGIH 
LKEEDHGRMF PVTDKSKSIV DALFNRINEL GVTVFTKTQV TKLLRKDDQI IGVETELEKI 
YAPCWLTTG GRTYPSTGAT GDGYKLAKKM GHTISPLYPT ESPIISEEPF ILDKTLQGLS 
LQDVNLTVLN QKGKPLVNHQ MDMLFTHFGI SGPAALRCSS FINQELTRNG NQPVTVALDV 
FPTKSFEEVP AKQLTEKQRL • SFVELLKDFQ FTVTKTLPLE KSFVTGGGIS LKEVTPKTME 
SKLVNGLFFA GELLDINGYT GGYNVTAAFV TGHVAGSHAA EIAEYTYLPI EEV 

EF085-1 (SEQ ID NO:321) 

TAACCCATGA AATCATTTTG TCCCGCATAT GGGGATATGA CTTTGACGGT GATGGCAGCA 
CAGTCCACAC TCATATCAAA AATCTGCGGG CGAACTGCCG GAAAATATCA TCAAAACCAT 
CCGCGGTGTA GGTTACCGAT TGGAGGAATC ATTATAATGG AAAGAAAAGG GATTTTCATT 
AAGGTTTTTT CCTATACGAT CATTGTCCTG TTACTGCTTG TCGGTGTAAC GGCAACACTG 
TTTGCACAGC AATTTGTGTC TTATTTCAGA GCGATGGAAG CACAGCAAAC AGTAAAATCC 
TATCAGCCAT TGGTGGAACT GATTCAGAAT AGCGATAGGC TTGATATGCA AGAGGTGGCA 
GGGCTGTTTC ACTACAATAA CCAATCCTTT GAGTTTTATA TTGAAGATAA AGAGGGAAGC 
GTACTCTATG CCACACCGAA TGCCGATACA TCAAATAGTG TTAGGCCCGA CTTTCTTTAT 
GTGGTACATA GAGATGATAA TATTTCGATT GTTGCTCAAA GCAAGGCAGG TGTGGGATTG 
CTTTATCAAG GGCTGACAAT TCGGGGAATT GTTATGATTG CGATAATGGT TGTATTCAGC 
CTTTTATGCG CGTATATCTT TGCGCGGCAA ATGACAACGC CGATCAAAGC CTTAGCGGAC 
AGTGCGAATA AAATGGCAAA CCTGAAAGAA GTACCGCCGC CGCTGGAGCG AAAGGATGAG 
CTTGGCGCAC TGGCTCACGA CATGCATTCC ATGTATATCA GGCTGAAAGA AACC ATCGCA 
AGGCTGGAGG ATGAAATCGC AAGGGAACAT GAGTTGGAGG AAACACAGCG ATATTTCTTT 
GCGGCAGCCT CTCATGAGTT AAAAACGCCC ATCGCGGCTG TAAGCGTTCT GTTGGAGGGA 
ATGCTTGAAA ATATCGGTGA CTACAAAGAC CATTCTAAGT ATCTGCGCGA ATGCATCAAA 
ATGATGGACA GGCAGGGCAA AACCATTTCC GAAATACTGG AGCTTGTCAG CCTGAACGAT 
GGGAGAATCG TACCCATAGC CGAACCGCTG GACATAGGGC GCACGGTTGC CGAGCTGCTA 
CCCGATTTTC AAACCTTGGC AGAGGCAAAC AACCAGCGGT TCGTCACAGA TATTCCAGCC 
GGACAAATTG TCCTGTCCGA TCCGAAGCTG ATCCAAAAGG CGCTATCCAA TGTCATATTG 
AATGCGGTTC AGAACACGCC CCAGGGAGGT GAGGTACGGA TATGGAGTGA GCCTGGGGCT 
GAAAAATACC GTCTTTCCGT TTTGAACATG GGCGTTCACA TTGATGATAC TGCACTTTCA 
AAGCTGTTCA TCCCATTCTA TCGCATTGAT CAGGCGCGAA GCAGCAAAAA GTGGGCGAAG 
CGGTTTGGGG CTTGCCATCG TACAAAAAAC GCTGGATGCC ATGAGCCTCC AATATGCGCT 
GGAAAACACC TCAGATGGCG TTTTGTTCTG GCTGGATTTA CCGCCCACAT CAACACTATA 
AATATTTAA 
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EF085-2 (SEQ ID NO:322) 

VFsSl^LL LLVGVTATLF AQQFVSYFRA MEAQQTVKSY QPLVELIQNS ^LDMQEVAG 

Ifhynnqsfe fyiedkegsv lyatpnadts nsvrpdflyv vhrddnisiv aqskagvgll 

™ V MIAIMWFSL LCAYIFARQM TTPIKALADS ANKMANLKEV ^EWCDEL 
S?AHDMHSM YIRLKETIAR LEDEIAREHE LEETQRYFFA AASHELKTPI AAVSVLLEGM 
?Sdykdh skylrIciL MDRQGKTISE ILELVSLNDG RIVPIAEPLD IGRTVAELLP 

dfoSaeaS qrfvtdipag qivlsdpkli qkalsnviln avqntpqgge vriwsepgae 
kyrS™ vhiddtalsk lfipfyridq arsskkwakr fgachrtkna gcheppicag 

KHLRWRFVLA GFTAHINTIN I 
EF085-3 (SEQ ID NO:323) 

GC AATTTGTGTC TTATTTCAGA GCGATGGAAG CACAGCAAAC AGTAAAATCC 

?atc1gcca^ggtggaact GATTCAGAAT AGCGATAGGC ttgatatgca agaggtggca 
oggc^tc actacaataa ccaatccttt gagttttata ttgaagataa acagggaagc 

ptaCTCTATG CCACACCGAA TGCCGATACA TCAAATAGTG TTAGGCCCGA CTTTCTTTA1 

gtg^acata gagatgataa tatttcgatt gtigctcaaa gcaaggcagg tgtgggattg 
2Stcaag ggctgacaat tcggggaatt gttatgattg cgataatggt tgtattcagc 

CTTTTATGCG CGTATATCTT TGCGCGGCAA ATGACAACGC CGATCAAAGC CTTAGCGGAC 
AG^GAATA AAATGGCAAA CCTGAAAGAA GTACCGCCGC CGCTGGAGCG ^AAGGATGAG 
SSS ™CGA CATGCATTCC ATGTATATCA GGCTGAAAGA AACCATCGCA 
ArrCTGGAGG ATGAAATCGC AAGGGAACAT GAGTTGGAGG AAACACAGCG ATATriClll 
GCGGCAGCCT CTCATGAGTT AAAAACGCCC ATCGCGGCTG TAAGCGTTCT GTTGGAGGGA 

SgISgaaI atatcggtga ctacaaagac cattctaagt atctgcgcga atgcatcaaa 

ATGATGGACA GGCAGGGCAA AACCATTTCC GAAATACTGG AGCTTGTCAG C CTGAACGAT 
nlrlr^ca TACCCATAGC CGAACCGCTG GACATAGGGC GCACGGTTGC CGAGCTGCTA 

SgaSSc Sacc^gc aga^gcaaac aaccagcggt tcgtcacaga tattccagcc 

rSSSSS TCCTGTCCGA TCCGAAGCTG ATCCAAAAGG CGCTATCCAA TGTCATATTG 

Stcc^S? IgaacIcgS ccagggaggt gaggtacgga tatggagtga gcctggggct 

JtSSSS GTCTTTCCGT TTTGAACATG GGCGTTCACA TTGATGATAC TGCACTTTCA 
?CCC™I TCGCATTGAT CAGGCGCGAA GCAGCAAAAA GTGGGCGAAG 

SgStcgS Stcccatcg tacaaaaaac gctggatgcc atgagcctcc aatatgcgct 
ggaaII?a?c tcagatggcg ttttgttctg gctggattta ccgcccacat caacactata 

AATATTT . . 

EF085-4 (SEQ ID NO:324) 

YoSSrGIV MIAIMWFSL LCAYIFARQM TTPIKALADS ANKMANLKEV PPPLERKDEL 
Y I RLKETI AR LEDEIAREHE LEETQRYFFA AASHELKTPI AAVSVLLEGM 
SKYLRECIKM MDRQGKTISE ILELVSLNDG RIVPIAEPLD IGRTVAELLP 

DFOTLAfS QRFvSlPAG QIVLSDPKLI QKALSNVILN AVQNTPQGGE VRIWSEPGAE 

KYRLsS VHIDDTALSK LFIPFYRIDQ ARSSKKWAKR FGACHRTKNA GCHEPPICAG 

KHLRWRFVLA GFTAHINTIN I 

EF086-1 (SEQ ID NO:325) 

TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT GATTTTATTA 
tattgaatac AGATGCAGAA AAATTAAATA AATTTACTGC tccgctgatg 
ct^at^aa aagatccaaa catacaatgg ccaatttatc gtgcaacagg agctaactta 
acagatatt^ caatcaccgt tttaggtact ggacttttgt tagaagataa tcaacgccta 
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GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG TGATGGCTTA 
TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG TTACGGGAAT 
GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA CTGGGAGATG 
AATGACCCTA AC ATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT ACAATTGATG 
GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAGC GCCAGAAACG 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT AACCTTAATT 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA AACGTGGCTT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA AGCGTTAATT 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA TAAAAAAGGC 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA GTTTGATGAA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA CACAAGAGAA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG TGGCTCAAAT 
AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG AATGAACTTA 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG AAGTGGCATT 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT AAGCGCAGCG 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA AAATGGTACT 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC GTATGATCCA 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT TGCGAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA AGTAGTCGCA 
GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC GGGGTTAAAT 
GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTC TAGTTAT CGCCAGTGGG 
TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 

EF086-2 (SEQ ID NO:326) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP IYRATGANLT 
DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLIQHGY FPYNGSYGNE 
LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYLQLMV NGKMPSMVSG RSISRAPETN 
PFTTEFESGK ETIANLTLIA KFAPENLRND IYTSIQTWLQ QSGSYYHFFK KPRDFEALID 
LKNWNSASP AQATPMQSLN VYGSMDRVLQ KJSINEYAVGIS MYSQRVGNYE FGNTENKKGW 
HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 
GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINLGSGIT GTTDASIETI LDNRMIHPQE 
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
TNTFAKISKN YGKTVENGTY EYLTWGKTN EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
MMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNASVSIEF DKGILEWAA 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIQEQKEHQE KDYTASSWKV 
YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
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EKDYTASSWK VYSEALKQAQ TVADQTTATQ = AKL = TLK = KQK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKKSIALVI 
LLVFRKSKSK K 

EF086-3 (SEQ ID NO:327) 

ACCAGAAAA TTTAAGAAAT ^^^^^^^^ AGCGTTAATT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA ^GAT^ GCAATCTTTA 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA ^ ^ 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTT 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAG1 



AAT 

EF086-4 (SEQ ID NO: 328) 



PENLRND IYTSIQTWLQ QSGSYYHFFK KPRDFEALID FGNTENKKGW 

SEES SS S= » =s — 



EF087-1 (SEQ ID NO:329) 



TAACTGGTGG ««» «™C« GGAGGGGTAA ^ ££££ 
CATGATGACC TATTCAATAC AGATCGAGAA "AT^AATA ^A^ AGCTAACTTA 
rrp2X ™ CAA aagATCCAAA CATACAATGG CCAATT1A1C ^xi^«~ ^^ppp™ 
« ShCKCOT TTTAOGTACT GGACTTTTGT TAGAAGATAA 
GTACAAGTAC AAGAAGCTGT TGCGTGCGTT TTAAAAAGTG TTTGGTC™ ^ 
TATCCTGATG GTTCCTTGAT TCAACATGGT ^Aliiiv- A CTGGGAG ATG 

GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA ^ 

AATGACCCTA ACATTAGTAA TTTA^TAAT ^SSS GcSgIaACG 

GTAAATGGAA AAATGCCATC GATGGTTTCT ^AGAAGTA TTTCCAGA^ 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT AA 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA AA 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATCTT^A * 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT ^AATCTTTA 
SSESS GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT JAATTTGGGA ATACGGAAAA 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ^TTOGCTCA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG ^ 
AATGGACAGG TTGCCTCTAT AGGAATGTTT ^ATAAAA JTAATGAAGG ™ 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA ™TT^ ^ 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATC^GATGAT JCM^ 

SSS5S 5SS 5S55S SEE ^ = aa 
=22£ SSSSSE SSS g -™ 

= SSS2 = = GTATGATCCA 
GTCATGATCA ATAGATG3AA TAATGACGAA GAAMTCCM SSaIcOT TCCGAATCCT 
£SEE = ™= SSS5 g«-Sga AG T AG TC GCA 
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GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC GGGGTTAAAT 
SgSgctcgc g^caatcat TCTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
aaa^aaSc aggaa^Laa agaacaccaa gaaaaagact acaccgcaag cagctggaaa 
g^Scagcg aagcattgaa acaagcacaa actgtggcag atcaaacaac agcaacgcaa 
gcagaagtag a^caagcaga aacagagtta cgttcggcag tgaagcaatt ggtaaaagtg 
ccaactaaag aagtagataa aaccaacttg ttgaaaatca tcaaagaaaa cgagaaacac 
caaSIag aISacaccgc aagcagttgg aaagtctaca gtcaagcatt gaagcaagcg 

PAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 

c^g^cgg cagt^cg attaacattg aaaaatagtg gggaaaataa aaaggagcaa 

AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 

aaacaagtta agcWaag ccaaggtggt ttcagaaaag ctagccaatt tttaccgagc 
a^SagIaa agaaa?Sat cgcgcttgtg attattggtc ttctagttat cgccag^gg 

TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 
EF087-2 (SEQ ID NO:330) 

TVPTANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP IYRATGANLT 

disi^gtc Slednqrlv qvqeavpsvl ksvssgdgly pdgsliqhgy FPYNGSYGNE 
SkgSr?q? SgsSemn Spnisnlfnv vdkgylqlmv ngkmpsmvsg Rsisrapetn 
pfSSesgk etianltlia kfapenlrnd iytsiqtwlq qsgsyyhffk kprdfealid 
l™Ssp aqatpmqsln vygsmdrvlq knneyavgis mysqrvgnye fgntenkkgw 
SSlyly nSfIqfdeg ywatidpyrl pgtivdtrel angaytckrs pQswvggsnn 
goSasiSmfI dksnegmnlv akkswflldg qiinlgsgit gttdasieti ldnrmihpqe 
Slnogsdkd nswislsaax plnnigyvfp nsmntldvqi eersgrygdi neyfvndkty 
™tfSiskn otengty eyliwgktn eeiaalsknk gytvlentan lqaieagnyv 
SSSdqe ™aydpm svisekidng vyrltlanpl qnnasvsief dkgilewaa 

^^SoN? ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIQEQKEHQE KDYTASSWKV 

SSlkqaS ™qSa evdqaetelr savkqlvkvp tkevdktnll kiikenekhq 

£££££ — TVADQTTATQ AEVDQAEAKL —TUC QK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKKSIALVI IGLLVIASGC 
LLVFRKSKSK K 

EF087-3 (SEQ ID NO:331) 



A ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA 
ANTCCATTGA ATAACATTGG CTATGTTTTT 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT 
TATACAAATA CATTTGCTAA AATTAGTAAA 
TACGAATATT TAACAGTGGT TGGGAAAACG 
AAAGGCTATA CTGTTCTAGA AAATACAGCA 
GTCATGATGA ATACATGGAA TAATGACCAA 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC 
TTACAAAATA ATGCATCC 

EF087.-4 (SEQ ID NO:332) 



GATAATTCTT GGATTAGTTT AAGCGCAGCG 
CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATTAACGAAT ACTTTGTTAA TGATAAAACC 
AATTATGGCA AGACTGTTGA AAATGGTACT 
AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AACTTACAAG CCATTGAAGC AGGTAATTAT 
GAAATTGCAG GACTGTATGC GTATGATCCA 
GGTGTTTATC GCTTAACTCT TGCGAATCCT 



NRMIHPQE 

VKLNQGSDKD NSWISLSAAX 
TNTFAKISKN YGKTVENGTY 
MMNTWNNDQE IAGLYAYDPM 



PLNNIGYVFP NSMNTLDVQI 
EYLTWGKTN EEIAALSKNK 
SVISEKIDNG VYRLTLANPL 



EERSGRYGDI NEYFVNDKTY 
GYTVLENTAN LQAIEAGNYV 
QNNAS 



EF088-1 (SEQ ID NO:333) 
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TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGCGCTAA CAGATACATT GATTTTATTA 
CATGATGACC TATTGAATAC AGATGCAGAA AAATTAAATA AATTTACTGC TCCGCTGATG 
CTGTATGCAA AAGATCCAAA CATACAATGG CCAATTTATC GTGCAACAGG AGCTAACTTA 
ACAGATATTT CAATCACCGT TTTAGGTACT GGACTTTTGT TAGAAGATAA TCAACGCCTA 
GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTTCCTCTGG TGATGGCTTA 
TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG TTACGGGAAT 
GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA CTGGGAGATG 
AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT ACAATTGATG 
GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAGC GCCAGAAACG 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT AACCTTAATT 
GCAAAATTTG CACCAGAAAA TTTAAGAAAT GACATTTATA CATCTATCCA AACGTGGCTT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGATTTTGA AGCGTTAATT 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CCTGCCCAAG CGACACCAAT GCAATCTTTA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATACGGAAAA TAAAAAAGGC 
TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG ACTTTGCTCA GTTTGATGAA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGACAGTTGA CACAAGAGAA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGGTAGG TGGCTCAAAT 
AATGGACAGG TTGCCTCTAT AGGAATGTTT TTAGATAAAA GTAATGAAGG AATGAACTTA 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTCAAATCA TTAATTTGGG AAGTGGCATT 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATCCTCGATA ATCGGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTAGTTT AAGCGCAGCG 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TGAATACGCT TGATGTTCAA 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGACTGTTGA AAATGGTACT 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCGCAGCTCT TTCTAAAAAC 
AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GACTGTATGC GTATGATC C A 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCTTAACTCT TGCGAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG GCATTCTTGA AGTAGTCGCA 
GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATAGTGC GGGGTTAAAT 
GGCAGCTCGC GTTCAATCAT TGTTAAAACA ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 
AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
. AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTCTAGTTAT CGCCAGTGGG 
TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 

EF088-2 (SEQ ID NO:334) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK LNKFTAPLML YAKDPNIQWP I YRATGANLT 
DISITVLGTG LLLEDNQRLV QVQEAVPSVL KSVSSGDGLY PDGSLIQHGY FPYNGSYGNE 
LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYLQLMV NGKMPSMVSG RSISRAPETN 
PFTTEFESGK ETIANLTLIA KFAPENLRND IYTSIQTWLQ QSGSYYHFFK KPRDFEALID 
LKNWNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS MYSQRVGNYE FGNTENKKGW 
HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTVDTREL ANGAYTGKRS PQSWVGGSNN 
GQVASIGMFL DKSNEGMNLV AKKSWFLLDG QIINLGSGIT GTTDASIETI LDNRMIHPQE 
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI NEYFVNDKTY 
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TNTFAKISKN YGKTVENGTY EYLTWGKTN 
MMNTWNNDQE IAGLYAYDPM SVISEKIDNG 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT 
YSEALKQAQT VADQTTATQA EVDQAETELR 
EKDYTASSWK VYSEALKQAQ TVADQTTATQ 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF 
LLVFRKSKSK K 



EEIAALSKNK GYTVLENTAN LQAIEAGNYV 
VYRLTLANPL QNNASVSIEF DKGILEWAA 
PEVTKEALEK LIQEQKEHQE KDYTASSWKV 
SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
AEVDQAEAKL RSAVKRLTLK NSGENKKEQK 
RKASQFLPST GEKKSIALVI IGLLVIASGC 



EF088-3 (SEQ ID NO:335) 

A ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 

AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT ACACCGCAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACCAAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GGGAAAATAA AAAGGAGCAA 
AAAAATGGGG GGAATAATGG ACACTTAAAT ACTAGTACAG GAGTTGATCA AACTGGTACG 
. AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCCAATT TTTACCGAGC 
ACAGGAGAAA AGAAA 

EF088-4 (SEQ ID NO:336) 

T PEVTKEALEK LIQEQKEHQE KDYTASSWKV 

YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP TKEVDKTNLL KIIKENEKHQ 
EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK NSGENKKEQK 
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST GEKK 



EF089-1 (SEQ ID NO:337) 

TGACAGATAC ACCTGCTAAC ACAGGAAACT 
TATAGGTCAA AAATTTTTTG GCTTATCTTT 
AATGACAGAC ATAGGAGAAT GAATATGAAC 
TGTATGTTAT TTGGCTGGAT TGGCGTGGAG 
ACACCAACAA TTCCCGAAAA TCAAGTGGAT 
GCGCCTGGTG CCAAACAAAC CGTAGAAATT 
ACCATTGAAA ATACGGTGAA CTCAGCGACA 
CAAAACGGGA TCAAACCTGA CAAAACCTTA 
CCGAAAGAAA TCATCTTGCC GAAGCATTCC 
CCTAAAGATT CTTTTGATGG CGTGATGGCT 
GAAACAACGA CTTCTGCGGA TCAATCAAAA 
GTTGTGGCTA TTATTCTTCA GCAAAATGAG 
GGGGTTAAAC CAGGCCAAGT CAACGCGCGA 
CAAGCGGCCT ATTTAAACGfif ATTACATTTA 
CTTTACCAAT CCGATACTGA GGATATGCAA 
ATTTCTTTAA AAGGGGAACG ATTAACGCCA 
GGTGTAAAAG ATGAAAAGGG CACCTATCAA 
CTGTACAAAT GGGAATTTAC AAAAGAATTT 
AATGAAAAAG ACGTAACCAT TAAAGGAACC 
ATCATTCTAG CGCTGCTCTT ATTGATTTTC 
GAACAACAAT CTGAGCAATA A 



AAGAACGACA GCATACACGC AAGATCGGGA 
CGGTCTTTTG GTGCTTATAA TACAACAAAG 
AGATGGAAAG TATATGCAAC GGTAATCGCT 
GCGCACGCTT CTGAATTTAA TTTTGCGGTC 
AAATCAAAAA CCTACTTTGA CTTAAAAATG 
CAGTTACGCA ATGATACAGA TGAAGACATT 
ACAAATTTAA ATGGCGTAGT AGAATATGGC 
CGTTTTAACT TAAAAGATTA TGTGGAAGCA 
CAAAAGACCT TACCTTTAAC CATTACGATG 
GGCGGTATAA CACTCAAAGA GAAAAAGAAA 
GGGTTAGCTA TTAATAATGA ATACTCCTAT 
ACAAAGGTTC AACCAGATTT AAAATTACTG 
AACGTCATCA ATGTTTCTTT ACAAAACCCA 
ATCAACACTG TTTCAAAAGG AGGCGAAACG 
GTGGCGCCAA ACTCTAACTT TAGTTACCCA 
GGAAAATATG TCTTGAAATC AACGGCCTAT 
GTCAAAGGCG CCAATGGTGA AGAACGGTAC 
ACTATTTCTG GGGACGTCGC TAAAGAATTA 
AATTGGTGGT TGTATCTACT GATTGCATTA 
TTCTTGTATC GTAAAAAGAA AAAAGAGGAA 



EF089-2 (SEQ ID NO: 33 8) 
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MNR WKVYATVIAC 

MLFGWIGVEA HASEFNFAVT PTIPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT 
IENTVNSATT NLNGWEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KTLPLTITMP 
KDSFDGVMAG GITLKEKKKE TTTSADQSKG LAINNEYSYV VAIILQQNET KVQPDLKLLG 
VKPGQVNARN VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI 
SLKGERLTPG KYVLKSTAYG VKDEKGTYQV KGANGEERYL YKWEFTKEFT ISGDVAKELN 
EKDVTIKGTN WWLYLLIALI ILALLLLIFF LYRKKKKEEE QQSEQ 

EF089-3 (SEQ ID NO:339) 



T CTGAATTTAA TTTTGCGGTC 

ACACCAACAA TTCCCGAAAA TCAAGTGGAT AAATCAAAAA CCTACTTTGA CTTAAAAATG 
GCGCCTGGTG CCAAACAAAC CGTAGAAATT CAGTTACGCA ATGATACAGA TGAAGACATT 
AC C ATTGAAA ATACGGTGAA CTCAGCGACA ACAAATTTAA ATGGCGTAGT AGAATATGGC 
CAAAACGGGA TCAAACCTGA CAAAACCTTA CGTTTTAACT TAAAAGATTA TGTGGAAGCA 
CCGAAAGAAA TCATCTTGCC GAAGCATTCC CAAAAGACCT TACCTTTAAC CATTACGATG 
CCTAAAGATT CTTTTGATGG CGTGATGGCT GGCGGTATAA CACTCAAAGA GAAAAAGAAA 
GAAACAACGA CTTCTGCGGA TCAATCAAAA GGG TTAGCTA TTAATAATGA ATACTCCTAT 
GTTGTGGCTA TTATTCTTCA GCAAAATGAG ACAAAGGTTC AACCAGATTT AAAATTACTG 
GGGGTTAAAC CAGGCCAAGT CAACGCGCGA AACGTCATCA ATGTTTCTTT ACAAAACCCA 
CAAGCGGCCT ATTTAAACCA ATTACATTTA ATCAACACTG TTTCAAAAGG AGGCGAAACG 
CTTTACCAAT CCGATACTGA GGATATGCAA GTGGCGCCAA ACTCTAACTT TAGTTACCCA 
ATTTCTTTAA AAGGGGAACG AT 

EF089-4 (SEQ ID NO:340) 

SEFNFAVT PTIPENQVDK SKTYFDLKMA PGAKQTVEIQ LRNDTDEDIT 

IENTVNSATT NLNGWEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KTLPLTITMP 

KDSFDGVMAG GITLKEKKKE TTTSADQSKG LAINNEYSYV VAIILQQNET KVQPDLKLLG 

VKPGQVNARN VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV APNSNFSYPI 

SLKGER 

EF090-1 (SEQ ID NO:341) 

TAGTCTCTAA GAAATAAACC TAAAATTATT GATATAAAGG ATGAACAAAT GAAAAAAGAA 
GAAATGCAAA TGCGTAATAC ACGTCGTCAA AAATCAGGAA AAAATAATAA AAAGAAAGTA 
ATTATTACTT CTTTGGTTGG ACTAGCTCTG GTTGCTGGGG GCAGTTATGT TTATTTTCAA 
AGTCACTTTT TNCCAACCAC AAAAGTAAAT GGAGTTTCTG TAGGCTGGTT AAATGTAAAT 
GCTGCAGAAG AAAAATTAGC GCAAGTTAAT CAAACCGAAG AAGTTGTGGT TCAAACGGGG 
ACAAAAGAAG AAAAAATTCA ACTTCCTAAA AAATACCAAT TGGATCAAAA ATTTTTAAAA 
GACCATTTAC ACAGTAGCAA GGTGAAGCTA CCGTTAAACG AGGCATTCAA AAAAGAACTA 
GAAGCCAAAT TAGCAACTTT GAGTTTTCCA GAGGGGAAAC CAAGCAAAAA TGCGAGTATC 
CGTCGAGGCA ATGGCACTTT TGAAATTGTT CCCGAAGAAC AAGGCACAGT AGTGGACACA 
CAGCGCTTAA ACCAGCAGAT TATTGCGGAT GTTGAAGCGG GAAAAGGCAA CTATCAATAT 
AATGCCAAAG ATTTTTATAA AGCCCCTGAA ATTACAAAAG AGGATCAAAC GTTAAAGGCA 
ACATTGACAA CGCTCAATAA CAAGTTAAAT AAAACAATTA CAGTTGATAT TAATGGTGAA 
AAAGTAGCCT TTGATAAAAC ACAAATTCAA AACGTGCTGA ATGATGATGG CACAATCAAC 
AAAGAAAAAC TAACTACTTG GGTGACACAA TTAGAAACAA CATATGGTTC TGCTAATCAA 
CCAGTTTTAT TTACAGATGT TCACGGCACG ACACGTCGTT TTAAAAACAA CGGAAGTTAT 
GGCTGGTCGA TTGATGGGGC CAAAACGCAA GAACTACTAG TAAACGCGCT GAATAGCCAA 
GAACAAACGA ATGCAATCAC TGCTCCGTTG GTTGGTGATA CCAAAGAAAA TAGTAAAATT 
GCCAATAATT ACATTGAAAT TGATTTAAAA GATCAAAAAA TGTATTGTTT CATTGATGGC 
AAAAAAATAG TCACCACAGA TGTCATTACT GGCAGATATA ACAAAGGAAC CGCAACAGTA 
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CCAGGATTCC ATACAATTTT ATATCGGACA ACCGATGTGA ATTTAGAAGG TCAAATGCTT 
GATGGTTCTC GATACAGTGT GCCAGTAAAA TATTGGATGC CGTTATTAAG TCAAGGGGGC 
GTTGTCACAC AAATCGGGAT TCATGACTCC GACCATAAAT TGGATAAGTA TGGCGATAAA 
GAAGCCTTTA AAACCGATGC TGGTAGTAAT GGCTGTATCA ATACGCCAGG AACAGAAGTT 
TCAAAAATCT TTGATGTATC CTATGACGGA ATGCCGGTAA TTATTTATGG ACATATCTAT 
GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG GCGAAGAAGT ATAA 



EF090-2 (SEQ ID NO:342) 

MRNTRRQK SGKNNKKKVI ITSLVGLALV AGGSYVYFQS 

HFXPTTKVNG VSVGWLNVNA AEEKLAQVNQ TEEVWQTGT KEEKIQLPKK YQLDQKFLKD 
HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR RGNGTFEIVP EEQGTWDTQ 
RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNKLNK TITVDINGEK 
VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGSANQP VLFTDVHGTT RRFKNNGSYG 
WSIDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKIA NNYIEIDLKD QKMYCFIDGK 
KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQMLD GSRYSVPVKY WMPLLSQGGV 
VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD 
DAPGEFDKPV DYGEEV 

EF090-3 (SEQ ID NO:343) 

CAC AAAAGTAAAT GGAGTTTCTG TAGGCTGGTT AAATGTAAAT 

GCTGCAGAAG AAAAATTAGC GCAAGTTAAT CAAACCGAAG AAGTTGTGGT TCAAACGGGG 
ACAAAAGAAG AAAAAATTCA ACTTCCTAAA AAATACCAAT TGGATCAAAA ATTTTTAAAA 
GACCATTTAC ACAGTAGCAA GGTGAAGCTA CCGTTAAACG AGGCATTCAA AAAAGAACTA 
GAAGCCAAAT TAGCAACTTT GAGTTTTCCA GAGGGGAAAC CAAGCAAAAA TGCGAGTATC 
CGTCGAGGCA ATGGCACTTT TGAAATTGTT CCCGAAGAAC AAGGCACAGT AGTGGACACA 
CAGCGCTTAA ACCAGCAGAT TATTGCGGAT GTTGAAGCGG GAAAAGGCAA CTATCAATAT 
AATGCCAAAG ATTTTTATAA AGCCCCTGAA ATTACAAAAG AGGATCAAAC GTTAAAGGCA 
ACATTGACAA CGCTCAATAA CAAGTTAAAT AAAACAATTA CAGTTGATAT TAATGGTGAA 
AAAGTAGCCT TTGATAAAAC ACAAATTCAA AACGTGCTGA ATGATGATGG CACAATCAAC 
AAAGAAAAAC TAACTACTTG GGTGACACAA TTAGAAACAA CATATGGTTC TGCTAATCAA 
• CCAGTTTTAT TTACAGATGT TCACGGCACG ACACGTCGTT TTAAAAACAA CGGAAGTTAT 
GGCTGGTCGA TTGATGGGGC CAAAACGCAA GAACTACTAG TAAACGCGCT GAATAGCCAA 
GAACAAACGA ATGCAATCAC TGCTCCGTTG GTTGGTGATA CCAAAGAAAA TAGTAAAATT 
GCCAATAATT ACATTGAAAT TGATTTAAAA GATCAAAAAA TGTATTGTTT CATTGATGGC 
AAAAAAATAG TC AC CAC AG A TGTCATTACT GGCAGATATA ACAAAGGAAC CGCAACAGTA 
CCAGGATTCC ATACAATTTT ATATCGGACA ACCGATGTGA ATTTAGAAGG TCAAATGCTT 
GATGGTTCTC GATACAGTGT GCCAGTAAAA TATTGGATGC CGTTATTAAG TCAAGGGGGC 
GTTGTCACAC AAATCGGGAT TCATGACTCC GACCATAAAT TGGATAAGTA TGGCGATAAA 
GAAGCCTTTA AAACCGATGC TGGTAGTAAT GGCTGTATCA ATACGCCAGG AACAGAAGTT 
TCAAAAATCT TTGATGTATC CTATGACGGA ATGCCGGTAA TTATTTATGG ACATATCTAT 
GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG GCGAAGAAGT AT 

EF090-4 (SEQ ID NO:344) 

TKVNG VSVGWLNVNA AEEKLAQVNQ TEEVWQTGT KEEKIQLPKK YQLDQKFLKD 
HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR RGNGTFEIVP EEQGTWDTQ 
RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNKLNK TITVDINGEK 
VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGSANQP VLFTDVHGTT RRFKNNGSYG 
WSIDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKIA NNYIEIDLKD QKMYCFIDGK 
KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQMLD GSRYSVPVKY WMPLLSQGGV 
VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD 
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DAPGEFDKPV DYGEEV 
EF091-1 (SEQ ID NO:345) 

TAATTGGNGG AGATTTTTAT GGCTAAAAAA GGCGGATTTT TCTTAGGNGC AGTAATTGGT 
GGAACAGCAG CAGCCGTTGC CGCATTATTA CTTGCACCAA AATCAGGTAA AGAATTACGT 
GATGATTTAT CAAATCAAAC AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT 
GCTGTTCAAA AAGGAACAGA ATTAACAGAA ATCGCAAAAC AAAAAGCCGG CGTTTTATCA 
GATCAAGCCT CTGATTTGGC AGGTTCTGTC AAAGAAAAAA CAAAAGATTC ATTGGATAAA 
GCACAAGGTG TTTCTGGCGA CATGCTTGAT AACTTTAAAA AACAAACAGG TGATTTATCT 
GATCAATTTA AAAAAGCAGC TGACGATGCT CAAGATCACG CAGAAGATTT AGGTGAAATT 
GCCGAAGATG CAGCAGAAGA TATCTATATT GACGTTAAAG ATTCTGCGGC AGCGGCCAAA 
GAAACTGTTT CTGCTGGTGT CGATGAAGCA ANAGAAACCA CCAAAGATGT TCCTGAAAAA 
GCTGCAGAAG CAAAAGAAGA TGTTAAAGAT GCAGCGAAAG ACGTAAAAAA AGAATTTAAA 
GGGTAA 

EF091-2 (SEQ ID NO:346) 

MAKKG GFFLGAVIGG TAAAVAALLL APKSGKELRD DLSNQTDDLK NKAQDYTDYA 
VQKGTELTEI AKQKAGVLSD QASDLAGSVK EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD 
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID VKDSAAAAKE TVSAGVDEAX ETTKDVPEKA 
AEAKEDVKDA AKDVKKEFKG 

EF091-3 (SEQ ID NO:347) 

AT CAAATCAAAC AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT 
GCTGTTCAAA' AAGGAACAGA ATTAACAGAA ATCGCAAAAC AAAAAGCCGG CGTTTTATCA 
GATCAAGCCT CTGATTTGGC AGGTTCTGTC AAAGAAAAAA CAAAAGATTC ATTGGATAAA 
GCACAAGGTG TTTCTGGCGA CATGCTTGAT AACTTTAAAA AACAAACAGG TGATTTATCT 
GATCAATTTA AAAAAGCAGC TGACGATGCT CAAGATCACG CAGAAGATTT AGGTGAAATT 
GCCGAAGATG CAGCAGAAGA TATCTATATT GACGTTAAAG ATTCTGCGGC AGCGGCCAAA 
GAAACTGTTT CTGCTGGTGT CGATGAAGCA ANAGAAACCA CCAAAGATGT TCCTGAAAAA 
GCTGCAGAAG CAAAAGAAGA TGTTAAAGAT GCAGCGAAAG ACGTAAAAAA AGAATTTAAA 
GGGTAA 

EF091-4 (SEQ ID NO:348) 
SNQTDDLK NKAQDYTDYA 

VQKGTELTEI AKQKAGVLSD QASDLAGSVK EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD 
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID VKDSAAAAKE TVSAGVDEAX ETTKDVPEKA 
AEAKEDVKDA AKDVKKEFKG 

EF092-1 (SEQ ID NO:349) 

TAAGGGGATG AAGAAAAAAT GGCAAAAAAA ACAATTATGT TAGTTTGTTC CGCAGGAATG 
AGCACGAGTT TATTAGTAAC AAAAATGCAA AAAGCAGCAG AAGATCGTGG CATGGAAGCA 
GACATCTTTG CAGTATCGGC TTCTGAAGCA GATACAAACT TGGAAAATAA AGAGGTGAAT 
GTTTTACTTT TAGGTCCACA AGTTCGTTTC ATGAAAGGGC AATTTGAACA AAAATTACAA 
C C AAAAGGG A TTCCTTTAGA TGTAATTAAC ATGGCAGATT ATGGCATGAT GAATGGCGAA 
AAAGTTTTAG ATCAAGCAAT CTCATTAATG GGATAA 

EF092-2 (SEQ ID NO:350) 

MAKKT IMLVCSAGMS TSLLVTKMQK AAEDRGMEAD IFAVSASEAD TNLENKEVNV 
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LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMNGEK VLDQAISLMG 
EF092-3 (SEQ ID NO:351) 
AG AAGATCGTGG CATGGAAGCA 

GACATCTTTG CAGTATCGGC TTCTGAAGCA GATACAAACT TGGAAAATAA AGAGGTGAAT 
GTTTTACTTT TAGGTCCACA AGTTCGTTTC ATGAAAGGGC AATTTGAACA AAAATTACAA 
CCAAAAGGGA TTCCTTTAGA TGTAATTAAC ATGGCAGATT ATGGCATGAT GAATGGCGAA 
AAAGTTTTAG ATCAAGCAAT CTCATTAATG GGAT 

EF092-4 (SEQ ID NO:352) 

EDRGMEAD IFAVSASEAD TNLENKEVNV 

LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGMMNGEK VLDQAISLMG 
EF093-1 (SEQ ID NO:353) 

TAGTTTTTTT CCGATAAAGG GAGAATTTTA ATGAGGCAAA AATATTCAGG AAACTTATTG 
TTCACGGCCA TGGCCATTGT TTATTTGATG AGTTTTCTCG CCCTTCAGTT ACTAGAAGAA 
CGTCAGTTAA CACAAAAATT TACGCAAGCT ACCCAGGAAT ACTATGCAGG GAAAAGTATC 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA AATAGACGAA AGTTAAAAAC AGAAGAAAGG 
CTCGTATACG CGCAAGTGAC CCTCGATTAT ACATACAAAA ATGAACAATT AAGAATAACT 
GTTTTATTAA ACAAATCTGG TCGAAAATAC CAATATCAAG AGAGAGTTTC TCATCAAAAA 
AAAGCGGAAA CAATACTGGA ATAG 

EF093-2 (SEQ ID NO:354) 

M RQKYSGNLLF TAMAIVYLMS FLALQLLEER QLTQKFTQAT QEYYAGKSIF 
HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRKYQ YQERVSHQKK 
AETILE 

EF093-3 (SEQ ID NO:355) 
CCTTCAGTT ACTAGAAGAA 

CGTCAGTTAA CACAAAAATT TACGCAAGCT ACCCAGGAAT ACTATGCAGG GAAAAGTATC 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA AATAGACGAA AGTTAAAAAC AGAAGAAAGG 
CTCGTATACG CGCAAGTGAC CCTCGATTAT ACATACAAAA ATGAACAATT AAGAATAACT 
GTTTTATTAA ACAAATCTGG TCGAAAATAC CAATATCAAG AGAGAGTTTC TCATCAAAAA 
AAAGCGGAAA CAATACTGG 



EF093-4 (SEQ ID NO:356) 

LQLLEER QLTQKFTQAT QEYYAGKSIF 
HLFLADVKQN RRKLKTEERL VYAQVTLDYT 
AETI- 

EF094-1 (SEQ ID NO:357) 



YKNEQLRITV LLNKSGRKYQ YQERVSHQKK 



TAAACATTTG AGACATTCAG AGGTGAATGT CTCTTTTTTA TTACTCAAAA ACGAAAGGGG 
ATTAATTATA TGAAAAAAAC AACATTTAAA AATTGGTCGT TATTTGCGAC TTTGGCTCTA 
TTAAGTCAAA CAATTGGCGG AACGATTGGT CCTACGATTG CTTTTGCCGA TGAAATTACT 
CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
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GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
AAATC TATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
AC TGGGG AAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAGTG 
ACTTTGGCTT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TGTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA G AAATC CGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AAC CGTTTG A TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGACG GAAATAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCC'ATCC AAACAAAAGC CCACCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGTT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CC AG ATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA CATTCCCGCA AACTGGGGAG AAAAATTCCA ACGTTCTACT GTTAGTTGGC 
TTTATC TTG A TTTTTTCGAC TGCTGGGTAT TATTTCTGGA ATCGCCGCAA TTAA 

EF094-2 (SEQ ID NO:358) 

MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG 
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YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDETVT LALDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT AGNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKS IV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VKTFPQTGEK NSNVLLLVGF 
ILIFSTAGYY FWNRRN 

EF094-3 (SEQ ID NO:359) 

CGA TGAAATTACT 

CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTACTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGQCAAAA CCAAGAAATT 
ACTGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAG 

EF094-4 (SEQ ID NO:360) 

DEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG 
YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHSIK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
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SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDET 

EF095-1 (SEQ ID NO:361) 

TAAGAATTGT TGGATTGTTC TTTAGAAAGA AGGGACAATA TGAAGCGAAG TAAATGGAAA 
GAATTGATAG TAACGGGCAT CTGCCATATA TTAGTATTCC CCATACTAAT ACAGACAACT 
GTTTTTGCAG AAACATTACC AAGTACAAAA CAAGTAAGAG AAGGAAC C AA TCATTCATTA 
ACAGCAGAAA AAGCCGAAAG TGAACAACCA CAGACAAAGG ATAAACTACA TGATGAAGAA 
ACACTGGCAT TGTCAAAAAG TGAGTTAATC GATAATGAGG CTAATGTTAC AAGTCAAACG 
ATTAGAGAAA GAATTGAGAC GCCTAACCTA ACTTATCGTT ATGGATTTAT TAATGAAGAG 
GGGCAGCCAG TAAACGCCAA TGAGATCCTT CTACAGTATC ATAGTTGGCA AGGCAATTCC 
CCAGATGGCA TAAATGTGTG GGAAGGTGAA AGTCAACCAG TGACAGCATC TACAGTGGCT 
AATTTAAAAG AAGTGGTAAT TCCAAGTGAG AAAGTAGCCG TCTATTCCGA CATGTCAACG 
GTGCTTGCAG CGAGTAATCA AACATTTTTT TTACCAAGAT ATTATACTTC TTTAAGCTTA 
TACAATAAGA AAGGGGAAAT TGATCCCAAT TATCCGCTGC CAACTATTTC CGACGCATCA 
GGAAACCAAT ATCCAACAAC AATTTCGCAA TTTGAATTGG AAAAAATGTC TGCACAACAA 
TATAGTCAGA AAACAGGAGT AACGTTTAAC ATTAGCGAGA GTCAAAAACT AATCGTTCCT 
TTGTACAACC AAGTGAAGGT TGATTCATCG AATCAATCTG GGCTATTGAA TTACTTTAAA 
TTTTCAGGGC CGGTTTATTA TCATGTTACC AATCGCAAAG TGACAGAACA TTTTGTGGAT 
ACTCAAGGGA AACCAATCCC TCCACCACCG GGGTTTAGAC AAGGAAAGCA AACACTTATT 
GAGCGTGACC CTTACACCTT TAAACAGAAA GATCTTTTGC CAAGTAGCTA TGAAATTGAC 
TCAAAAACGT ATCAATTTCA AGGATGGTAT AAAGGGAAAA CGAAACCTGA AAATTTAGAA 
AAAAGCGTAA CGCCCAGTTA TGATATTACC TATGACGACA ATGATGATTT AACTGTTGTC 
TATAAGGAGA TACCTCAAAA AAATTATACA TTTGAGGATG TCAATGGTGT TGAAATTGCA 
CCACCATCTG ATTTTATTCA GGATCACCAA CAACCAATAA CTACGGATGG CTTTCGCTAT 
TTAGCTGGAA AAAAACTGCC ACAACAATAC AGCGTTAACG GTAAAACTTA TTTATATCAA 
GGTTGGTATC AAGATAAAAC NAAACAAGAG AGCTTAGAAA AAACGAAGCG ACCCATAAAC 
TCCCCTGTTT TTAATGAAAT GAACGCTATT ACAGCAGTGT ATAAGGAAAT AACTGCAAAA 
GCTGAAATGC AAATAGAAGG ACTAGTCAAA GTCATGCCAA GTGGTTATAT ACAAATTTGG 
CAGATTATGC TTACAAATGT GGGAGAAGTA CCGTTAAAAA AAATAAACTT AAAGCCAGCA 
AGTGGTTGGT CACCAGGTCT AGCTCGGCCA ATCCAAGTCA CGATTCGTGT TGGATCTGAA 
CCAAACAAAA TTGTTCCTAT TACTGATGAA AATTGGCGAG TTGGCATTAC TTTAAATACG 
GAAGTGCCTA TTGGTCAGAC AGCAACTATT ATGATGACAA CAATTGCTAC AGGTGAACCA 
GATCAAGTGT TACAAGCGGC TGTTGAAATG AATGGAAATT TTTCTGCTGT TCACGCAGCT 
GATACTGTCA GAATCCAACC TAAAAATCAA GAAATTGTGG CACCAGATGA GGAAGGTTTT 
ATCAGCACAC CAACTTTTGA TTTTGGCAAA GTCGCCATTT CTAGCAACAC GCAGCAACAT 
GGTTTAAAGC AGGCAGCAGA TTATTATGAA AATGGTCAGG AAAATCCATA TTTACGTTTG 
AAAAAATCAC AACCCAATTG GGCACTAACT GCAGAACTAT CCCCCTTTGA AGGAAGAGTG 
GATCAACTAT CATCAATGAC AAAGTTATTG TTAGGAACAA CCAATGTTTC AGGTTTTATT 
CAGTACAATC AACCAACGGA AACTAAAGTT GCTCTTGGCA AAACAACCGC TATTCAATTA 
GTTGCCAACG GTGTAGCTAG CCATATTGTT GCCAATGGTC AGTTTGACGA AAGTGATGTT 
TATCAATTTG ATTTTTCTTT TGATCAAATC AAATTAGAAA TTCCAGCAAA TCAAGGTAGA 
AAAGATCAAA CTTATCAAGC AATGGTGACT TGGAATTTAG TGACAGGCCC ATAA 



EF095-2 (SEQ ID NO : 3 62 ) 

MKRSKWKE LIVTGICHIL VFPILIQTTV FAETLPSTKQ VREGTNHSLT 

AEKAESEQPQ TKDKLHDEET LALSKSELID NEANVTSQTI RERIETPNLT YRYGFINEEG 

QPVNANEILL QYHSWQGNSP DGINVWEGES QPVTASTVAN LKEWIPSEK VAVYSDMSTV 
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LAASNQTFFL PRYYTSLSLY NKKGEIDPNY PLPTISDASG NQYPTTISQF ELEKMSAQQY 
SQKTGVTFNI SESQKLIVPL YNQVKVDSSN QSGLLNYFKF SGPVYYHVTN RKVTEHFVDT 
QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD LLPSSYEIDS KTYQFQGWYK GKTKPENLEK 
SVTPSYDITY DDNDDLTWY KEIPQKNYTF EDVNGVEIAP PSDFIQDHQQ PITTDGFRYL 
AGKKLPQQYS VNGKTYLYQG WYQDKTKQES LEKTKRPINS PVFNEMNAIT AVYKEITAKA 
EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP LKKINLKPAS GWSPGLARPI QVTIRVGSEP 
NKIVPITDEN WRVGITLNTE VPIGQTATIM MTTIATGEPD QVLQAAVEMN GNFSAVHAAD 
TVRIQPKNQE IVAPDEEGFI STPTFDFGKV AISSNTQQHG LKQAADYYEN GQENPYLRLK 
KSQPNWALTA ELSPFEGRVD QLSSMTKLLL GTTNVSGFIQ YNQPTETKVA LGKTTAIQLV 
ANGVASHIVA NGQFDESDVY QFDFSFDQIK LEIPANQGRK DQTYQAMVTW NLVTGP 

EF095-3 (SEQ ID NO:363) 

AAGTACAAAA CAAGTAAGAG AAGGAACCAA TCATTCATTA 

ACAGCAGAAA AAGCCGAAAG TGAACAACCA CAGACAAAGG ATAAACTACA TGATGAAGAA 
ACACTGGCAT TGTCAAAAAG TGAGTTAATC GATAATGAGG CTAATGTTAC AAGTCAAACG 
ATTAGAGAAA GAATTGAGAC GCCTAACCTA ACTTATCGTT ATGGATTTAT TAATGAAGAG 
GGGCAGCCAG TAAACGCCAA TGAGATCCTT CTACAGTATC ATAGTTGGCA AGGCAATTCC 
CCAGATGGCA TAAATGTGTG GGAAGGTGAA AGTCAACCAG TGACAGCATC TACAGTGGCT 
AATTTAAAAG AAGTGGTAAT TCCAAGTGAG AAAGTAGCCG TCTATTCCGA CATGTCAACG 
GTGCTTGCAG CGAGTAATCA AACATTTTTT TTACCAAGAT ATTATACTTC TTTAAGCTTA 
TACAATAAGA AAGGGGAAAT TGATCCCAAT TATCCGCTGC CAACTATTTC CGACGCATCA 
GGAAACCAAT ATCCAACAAC AATTTCGCAA TTTGAATTGG AAAAAATGTC TGCACAACAA 
TATAGTCAGA AAACAGGAGT AACGTTTAAC ATTAGCGAGA GTCAAAAACT AATCGTTCCT 
TTGTACAACC AAGTGAAGGT TGATTCATCG AATCAATCTG GGCTATTGAA TTACTTTAAA 
TTTTCAGGGC CGGTTTATTA TCATGTTACC AATCGCAAAG TGACAGAACA TTTTGTGGAT 
AC TC AAGGGA AACCAATCCC TCCACCACCG GGGTTTAGAC AAGGAAAGCA AACACTTATT 
GAGCGTGACC CTTACACCTT TAAACAGAAA GATCTTTTGC CAAGTAGCTA TGAAATTGAC 
TCAAAAACGT ATCAATTTCA AGGATGGTAT AAAGGGAAAA CGAAACCTGA AAATTTAGAA 
AAAAGCG TAA CGCCCAGTTA TGATATTACC TATGACGACA ATGATGATTT AACTGTTGTC 
TATAAGGAGA TACCTCAAAA AAATTATACA TTTGAGGATG TCAATGGTGT TGAAATTGCA 
CCACCATCTG ATTTTATTCA GGATCACCAA CAACCAATAA CTACGGATGG CTTTCGCTAT 
TTAGCTGGAA AAAAACTGCC ACAACAATAC AGCGTTAACG GTAAAACTTA TTTATATCAA 
GGTTGGTATC AAGATAAAAC NAAACAAGAG AGCTTAGAAA AAACGAAGCG ACCCATAAAC 
TCCCCTGTTT TTAATGAAAT GAACGCTATT ACAGCAGTGT ATAAGGAAAT AACTGCAAAA 
GCTGAAATGC AAATAGAAGG ACTAGTCAAA GTCATGCCAA GTGGTTATAT ACAAATTTGG 
CAGATTATGC TTACAAATGT GGGAGAAGTA CCGTTAAAAA AAATAAACTT AAAGCCAGCA 
AGTGGTTGGT CACCAGGTCT AGCTCGGCCA ATCCAAGTCA CGATTCGTGT TGGATCTGAA 
CCAAACAAAA TTGTTCCTAT TACTGATGAA AATTGGCGAG TTGGCATTAC TTTAAATACG 
GAAGTGCCTA TTGGTCAGAC AGCAACTATT ATGATGACAA CAATTGCTAC AGGTGAAC C A 
GATCAAGTGT TACAAGCGGC TGTTGAAATG AATGGAAATT TTTCTGCTGT TCACGCAGCT 
GATACTGTCA GAATCCAACC TAAAAATCAA GAAATTGTGG CACCAGATGA GGAAGGTTTT 
ATCAGCACAC CAACTTTTGA TTTTGGCAAA GTCGCCATTT CTAGCAACAC GCAGCAACAT 
GGTTTAAAGC AGGCAGCAGA TTATTATGAA AATGGTCAGG AAAATCCATA TTTACGTTTG 
AAAAAATCAC AACCCAATTG GGCACTAACT GCAGAACTAT CCCCCTTTGA AGGAAGAGTG 
GATCAACTAT CATCAATGAC AAAGTTATTG TTAGGAACAA CCAATGTTTC AGGTTTTATT 
CAGTACAATC AACCAACGGA AACTAAAGTT GCTCTTGGCA AAACAACCGC TATTCAATTA 
GTTGCCAACG GTGTAGCTAG CCATATTGTT GCCAATGGTC AGTTTGACGA AAGTGATGTT 
TATCAATTTG ATTTTTCTTT TGATCAAATC AAATTAGAAA TTCCAGCAAA TCAAGGTAGA 
AAAGATCAAA CTTATCAAGG AATGGTGACT TGGAATTTAG TGACAGGCCC A 

EF095-4 (SEQ IDNO:364) 



STKQ VREGTNHSLT 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E, faecalis Genes. 

AEKAESEQPQ TKDKLHDEET LALSKSELID NEANVTSQTI RERIETPNLT YRYGFINEEG 
QPVNANEILL QYHSWQGNSP DGINVWEGES QPVTASTVAN LKEWIPSEK VAVYSDMSTV 
LAASNQTFFL PRYYTSLSLY NKKGEIDPNY PLPTISDASG NQYPTTISQF ELEKMSAQQY 
SQKTGVTFNI SESQKLIVPL YNQVKVDSSN QSGLLNYFKF SGPVYYHVTN RKVTEHFVDT 
QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD LLPSSYEIDS KTYQFQGWYK GKTKPENLEK 
SVTPSYDITY DDNDDLTWY KEIPQKNYTF EDVNGVEIAP PSDFIQDHQQ PITTDGFRYL 
AGKKLPQQYS VNGKTYLYQG WYQDKTKQES LEKTKRPINS PVFNEMNAIT AVYKEITAKA 
EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP LKKINLKPAS GWSPGLARPI QVTIRVGSEP 
NKIVPITDEN WRVGITLNTE VPIGQTATIM MTTIATGEPD QVLQAAVEMN GNFSAVHAAD 
TVRIQPKNQE IVAPDEEGFI STPTFDFGKV AISSNTQQHG LKQAADYYEN GQENPYLRLK 
KSQPNWALTA ELSPFEGRVD QLSSMTKLLL GTTNVSGFIQ YNQPTETKVA LGKTTAIQLV 
ANGVASHIVA NGQFDESDVY QFDFSFDQIK LEIPANQGRK DQTYQAMVTW NLVTGP 

EF096-1 (SEQ ID NO:365) 

TGAGGTGGCC AAGTTAAAAT GAAAAAATTA CAGTCACTTT TTATTGGAAT TATCGCTATT 
ATTGTCATCT TGTTTTTTGG CGTGCGCCAA TTGGAGAAAG CAAGTGGCAT GGCAGGAGCA 
GATACCTTGA CCATTTACAA TTGGGGGGAC TATATAGATC CGGCCTTGAT TAAGAAATTT 
GAAAAAGAAA CAGGCTATAA AGTCAATTAC GAAACCTTTG ATTCTAATGA AGCTATGTAT 
AC AAAAATTC AGCAAGGTGG CACAGCCTAT GATATTGCCA TTCCTTCTGA ATATATGATT 
CAAAAAATGA TGAAAGCGAA GATGCTTTTA CCACTTGATC ACAGCAAATT AAAAGGCTTA 
GAAAACATTG ATGCAGGCTT TTTAGATCAA TCCTTTGATC CCAAAAATAA GTTTTCCGTT 
CCGTACTTCT GGGGCACGTT GGGGATTATT TATAATGATA AATTTATTGA CGGCCGTCAG 
ATCCAACATT GGGATGATTT ATGGCGCCCG GAATTAAAAA ATAATGTCAT GCTGATTGAT 
GGCGCTCGCG AAGTGTTAGG ATTATCTTTG AACAGTTTAG GCTATTCGTT AAACAGTAAA 
AACGACCAAC AATTACGTCA GGCTACCGAT AAGTTAAACC GATTAACGAA CAATGTCAAA 
GCAATTGTTG CCGATGAAAT CAAAATGTAC ATGGCTAATG AAGAAAGTGC AGTTGCTGTA 
ACTTTCTCTG GTGAAGCTGC TGAAATG CTA GAAAACAATG AACATCTACA TTATGTGATT 
CCCAGTGAAG GCTCTAATCT CTGGTTTGAT AACATTGTGA TGCCTAAGAC AGCCAAAAAT 
AAAGAGGGTG CCTATGCATT TATGAACTTT ATGTTACGAC CAGAAAATGC GGCACAAAAT 
GCAGAATATA TTGGTTATTC CACACCAAAT AAAGAAGCTA AAAAACTATT ACCAAAAGAA 
GTTGCCGAAG ATAAACAATT TTATCCAGAT GATGAAACTA TCAAACATTT AGAAGTTTAC 
CAAGACTTAG GTCAAGAATA CTTAGGAATT TATAACGATC TGTTCTTGGA GTTTAAGATG 
TATCGGAAAT AA 

EF096-2 (SEQ ID NO:366) 

MKKLQ SLFIGIIAII VILFFGVRQL EKASGMAGAD TLTIYNWGDY IDPALIKKFE 
KETGYKVNYE TFDSNEAMYT KIQQGGTAYD IAIPSEYMIQ KMMKAKMLLP LDHSKLKGLE 
NIDARFLDQS FDPKNKFSVP YFWGTLGIIY NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG 
AREVLGLSLN SLGYSLNSKN DQQLRQATDK LNRLTNNVKA IVADEIKMYM ANEESAVAVT 
FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN IVMPKTAKNK EGAYAFMNFM LRPENAAQNA 
EYIGYSTPNK EAKKLLPKEV AEDKQFYPDD ETIKHLEVYQ DLGQEYLGIY NDLFLEFKMY 
RK 

EF096-3 (SEQ ID NO:367) 
AAGTGGCAT GGCAGGAGCA 

GATACCTTGA CCATTTACAA TTGGGGGGAC TATATAGATC CGGCCTTGAT TAAGAAATTT 
GAAAAAGAAA CAGGCTATAA AGTCAATTAC GAAACCTTTG ATTCTAATGA AGCTATGTAT 
ACAAAAATTC AGCAAGGTGG CACAGCCTAT GATATTGCCA TTCCTTCTGA ATATATGATT 
CAAAAAATGA TGAAAGCGAA GATGCTTTTA CCACTTGATC ACAGCAAATT AAAAGGCTTA 
GAAAACATTG ATGCACGCTT TTTAGATCAA TCCTTTGATC CCAAAAATAA GTTTTCCGTT 
CCGTACTTCT GGGGCACGTT GGGGATTATT TATAATGATA AATTTATTGA CGGCCGTCAG 
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ATCCAACATT GGGATGATTT ATGGCGCCCG GAATTAAAAA ATAATGTCAT GCTGATTGAT 
GGCGCTCGCG AAGTGTTAGG ATTATCTTTG AACAGTTTAG GCTATTCGTT AAACAGTAAA 
AACGACCAAC AATTACGTCA GGCTACCGAT AAGTTAAACC GATTAACGAA CAATGTCAAA 
GCAATTGTTG CCGATGAAAT CAAAATGTAC ATGGCTAATG AAGAAAGTGC AGTTGCTGTA 
ACTTTCTCTG GTGAAGCTGC TGAAATGCTA GAAAACAATG AACATCTACA TTATGTGATT 
CCCAGTGAAG GCTCTAATCT CTGGTTTGAT AACATTGTGA TGCCTAAGAC AGCCAAAAAT 
AAAGAGGGTG CCTATGCATT TATGAACTTT ATGTTACGAC CAGAAAATGC GGCACAAAAT 
GCAGAATATA TTGGTTATTC CACACCAAAT AAAGAAGCTA AAAAACTATT ACCAAAAGAA 
GTTGCCGAAG ATAAACAATT TTATCCAGAT GATGAAACTA TCAAACATTT AGAAGTTTAC 
CAAGACTTAG GTCAAGAATA CTTAGGAATT TATAACGATC TGTTCTTGGA GTTTAAGATG 
TATCGGAAA 

EF096-4 (SEQ ID NO:368) 
SGMAGAD TLTIYNWGDY IDPALIKKFE 

KETGYKVNYE TFDSNEAMYT KIQQGGTAYD IAIPSEYMIQ KMMKAKMLLP LDHSKLKGLE 
NIDARFLDQS FDPKNKFSVP YFWGTLGIIY NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG 
. AREVLGLSLN SLGYSLNSKN DQQLRQATDK LNRLTNNVKA IVADEIKMYM ANEESAVAVT 
FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN IVMPKTAKNK EGAYAFMNFM LRPENAAQNA 
EYIGYSTPNK EAKKLLPKEV AEDKQFYPDD ETIKHLEVYQ DLGQEYLGIY NDLFLEFKMY 
RK 

EF097-1 (SEQ ID NO:369) 

TAGAAGTATT CTAATTATCT ACATAGAGAG CGAGGGACAA GGAATATGAA GGAAAAAGAA 
ATGCATTCGC TCTTTTTTAA ACATAAGTTT GTGAAAGTAA CTCCCTATTT ACGTCGTTTT 
GGTCATCGTT TGAGTGGGAT GATTATGCCA AATTTGAGTA TTTTTATTGC GTGGAGCTTA 
TTGTCTTTGG TGGCTGGCTA TACGACTGGG AATCTACGGC TAGCTCTTTC TGAAGTCGAA 
ACGATAATGA TTCGAGTTGT TTTACCGATT CTAATTGGTT . TTACAGGCGG AAAAATGTTC 
GAGGAACAAC GTGGCGGCGT TGTTGCTGCT ATTGCGACAG TGGGCGTGAT TGTTTCCACA 
GATGTTCCAC AGTTGTTTGG TGCTATGTTT ATTGGCCCTT TAGCAGGATA TACTTTCGCC 
AAAATTGAAC AAATTCTCTT ACCGAAAGTT AAAGAAGGCT ACGAGATGCT GACTAAAAAC 
•TTTTTAGCAG GAATTGTGGG AGGACTGCTG TGCTGTTTTG GTATTCTGGT TGTAGCTCCG 
GCTGTTGAAA GCGCTAGTTT TTGGCTGTAT CAATTTTCTT CTTGGTTAAT TGAAGCCAAT 
CTTTTACCAT TGGTTCACGT TTTCTTAGAG CCCTTAAAAG TGTTATTTTT TAATAATGCG 
ATTAACCATG GCTTATTAAC GCCTCTAGGT TTAGAAGGTG CTAGTCAAAC AGGTCAGTCC 
ATTTTATTTC TATTGGAAAC AAACCCTGGA CCAGGCGTGG GCGTTTTGGT TGCTTTTCTG 
CTGTTTGGGC CTGTAGGACA ACGAAAAACA GCAGGAGGTG CCACCATGAT TCAACTGATT 
GGGGGCATTC ATGAAATTTA TTTTCCGTTT GTTTTGATGG ACCCGCGCTT ATTTTTAGCA 
GTAATTGCTG GAGGAATGAG TGGTACGCTT GTTTTTCAAA TATTTAATGT GGGTCTAAGT 
GCTCCAGCTT CGCCAGGTTC ATTGGTTGCG ATTTTAGCCA ATGCCCCGAC TGATGCGAGG 
CTGGCGGTTT TTAGCGGAAT TTTTGTTAGC TTTCTGTGCT CTTTTGCAAT AGCAAGCTTG 
TTATTAAAAC GTCAACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC 
CAAGTGGAAA CAGTCACACC TAACTATCAG CAAATTTTAT TTGTTTGTGA TGCAGGAATG 
GGCTCAAGTG CCATGGGGGC TAGTTTGCTA AGCCGACAAT TAAAAGCTGT GAACTTGGAG 
ATGCCTGTGA CTTACCAGTC CGTTCATCAG ATGAAGTGGC AGCCTAAGAC ATTAGTGGTC 
ATTCAAGCAG AATTGAAACA GTTAGCACAA AAGTACGTCC CAGAAAAGGA TATGGTGAGT 
GTTCAAAATT TTTTAGAAAT TAAATCCTAT TACCCGCAAG TTTTAGCCAA ACTGACTGCT 
TCTTCTCAAG AGCAATCTTC ACTTGGTTCA GAGTCTACTG AAACGAACTC GACAAAACAA 
ATACAGAAGC TTGTTTTTTT ATATGCCGAG AATGTTCGAG GATCGCAAAC AATGGGAATG 
GAATTATTGC GGCAACAAGC GGCGAAACAA GGAGTCGCGA TTGAAGTATC TAAAGAGCCA 
CTGGAAACAG TCTTTTTTAC CAAGGAGACA ACCTACGTAG TGACTCGTGA AC TGGCGC AA 
GCCTATCATT TAGATCTAAC GCAACAAAAT TTATACGTAG TTACTAGTTT TTTGAATAAG 
AAAGAGTATC AAGAATGGCT GGAAGGAGGA GCTGATAGAT GTTTTTAA 
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EF097-2 (SEQ ID NO:370) 
MLTKNF LAGIVGGLLC CFGILWAPA 

VESASFWLYQ FSSWLIEANL LPLVHVFLEP LKVLFFNNAI NHGLLTPLGL EGASQTGQSI 

LFLLETNPGP GVGVLVAFLL FGPVGQRKTA GGATMIQLIG GIHEIYFPFV LMDPRLFLAV 

IAGGMSGTLV FQIFNVGLSA PASPGSLVAI LANAPTDARL AVFSGIFVSF LCSFAIASLL 

LKRQRGIEPV SMIKMKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS RQLKAVNLEM 

PVTYQSVHQM KWQPKTLWI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY PQVLAKLTAS 

SQEQSSLGSE STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG VAIEVSKEPL 

ETVFFTKETT YWTRELAQA YHLDLTQQNL YWTSFLNKK EYQEWLEGGA DRCF 

EF097-3 (SEQ ID NO:371) 

ACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC 

CAAGTGGAAA CAGTCACACC TAACTATCAG CAAATTTTAT TTGTTTGTGA TGCAGGAATG 

GGCTCAAGTG CCATGGGGGC TAGTTTGCTA AGCCGACAAT TAAAAGCTGT GAACTTGGAG 

ATGCCTGTGA CTTACCAGTC CGTTCATCAG ATGAAGTGGC AGCCTAAGAC ATTAGTGGTC 

ATTCAAGCAG AATTGAAACA GTTAGCACAA AAGTACGTCC CAGAAAAGGA TATGGTGAGT 

GTTCAAAATT TTTTAGAAAT TAAATCCTAT TACCCGCAAG TTTTAGCCAA ACTGACTGCT 

TCTTCTCAAG AGCAATCTTC ACTTGGTTCA GAGTCTACTG AAACGAACTC GACAAAACAA 

ATACAGAAGC TTGTTTTTTT ATATGCCGAG AATGTTCGAG GATCGCAAAC AATGGGAATG 

GAATTATTGC GGCAACAAGC GGCGAAACAA GGAGTCGCGA TTGAAGTATC TAAAGAGC C A 

CTGGAAACAG TCTTTTTTAC CAAGGAGACA ACCTACGTAG TGACTCGTGA ACTGGCGCAA 

GCCTATCATT TAGATCTAAC GCAACAAAAT TTATACGTAG TTACTAGTTT TTTGAATAAG 

AAAGAGTATC AAGAATGGCT GGAAGGAGGA GCTGATAGAT GTTTTT 

EF097-4 (SEQ ID NO:372) 

RGIEPV SMIKMKEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS RQLKAVNLEM 
PVTYQSVHQM KWQPKTLWI QAELKQLAQK YVPEKDMVSV QNFLEIKSYY PQVLAKLTAS 
SQEQSSLGSE STETNSTKQI QKLVFLYAEN VRGSQTMGME LLRQQAAKQG VAIEVSKEPL 
ETVFFTKETT YWTRELAQA YHLDLTQQNL YWTSFLNKK EYQEWLEGGA DRCF 

EF098-1 (SEQ ID NO:373) 

TAAATGAAAA AGACAAAAGT AATGACATTG ATGGCAACCA CAACTTTAGG CGCACTGGCA 
CTTGTACCAA TGAGTGCATT AGCAGTCGAC GGTGGTGAAT ACCAAACAAA CGGAGCGATT 
CAATTTGCAC CAAATACGAA CCCAACGAAT CCAGTTGATC CGACGAATCC AGACCCAGAT 
AAACCAATTA CACCAGTTGA TCCAACTGAT CCGACAGGGC CTAAGCCAGG GACAGCAGGT 
CCGTTATCCA TTGACTATGC ATC TAGCTTA TCTTTTGGGG AACAAACGAT TACCTCAAAA 
AATATGACCT ACTATGCAGA AACACAAAAA TACAAAGATA ACGCTGGTGC CGACCAAGAA 
GGCCCAAACT TTGTTCAAGT CTCAGATAAT CGTGGGACTG AGACAGGTTG GACGCTAAAA 
GTAAAACAAA ATGGTCAATT CAAAACTGAA GCC AACCAAG AACTAACAGC GGCCAAAGTA 
ACTTTAAGCA ACGGACGCGT GGTTTCAGCT TCACAATCTG CAAAGCCAAC GACAGCGCCA 
GCTACGATTG AATTAAACCC AACTGGGGCT GAATCAGTGG TCATGGCTGC TGGCGATAAA 
GAAGGTGCGG GTACGTACTT AATGAGCTGG GGCGATAGTG TAGATACCGC TAAAACAAGT 
ATTTCATTAG AAGTACCTGG TTCAACCACA AAATATGCGA AAAAATACAC GACAACTTTT 
ACTTGGACTT TGACAGATAC ACCTGCTAAC ACAGGAAACT AA 

EF098-2 (SEQ ID NO:374) 

MKKTKVMTLM ATTTLGALAL VPMSALAVDG GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK 
PITPVDPTDP TGPKPGTAGP LSIDYASSLS FGEQTITSKN MTYYAETQKY KDNAGADQEG 
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PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAKVT LSNGRWSAS QSAKPTTAPA 
TIELNPTGAE SWMAAGDKE GAGTYLMSWG DS VDTAKTS I SLEVPGSTTK YAKKYTTTFT 
WTLTDTPANT GN 

EF098-3 (SEQ ID NO:375) 

AGTCGAC GGTGGTGAAT ACCAAACAAA CGGAGCGATT 

CAATTTGCAC CAAATACGAA CCCAACGAAT CCAGTTGATC CGACGAATCC AGACCCAGAT 

AAACCAATTA CACCAGTTGA TCCAACTGAT CCGACAGGGC CTAAGCCAGG GACAGCAGGT 

CCGTTATCCA TTGACTATGC ATCTAGCTTA TCTTTTGGGG AACAAACGAT TACCTCAAAA 

AATATGACCT ACTATGCAGA AACACAAAAA TACAAAGATA ACGCTGGTGC CGACCAAGAA 

GGCCCAAACT TTGTTCAAGT CTCAGATAAT CGTGGGACTG AGACAGGTTG GACGCTAAAA 

GTAAAACAAA ATGGTCAATT CAAAACTGAA GCCAACCAAG AACTAACAGC GGCCAAAGTA 

ACTTTAAGCA ACGGACGCGT GGTTTCAGCT TCACAATCTG CAAAGCCAAC GACAGCGCCA 

GCTACGATTG AATTAAACCC AACTGGGGCT GAATCAGTGG TCATGGCTGC TGGCGATAAA 

GAAGGTGCGG GTACGTACTT AATGAGCTGG GGCGATAGTG TAGATACCGC TAAAACAAGT 

ATTTCATTAG AAGTACCTGG TTCAACCACA AAATATGCGA AAAAATACAC GACAACTTTT 
ACTTGGACTT TGACAGATAC ACCTGCTAAC ACAGGAAACT 

EF098-4 (SEQ ID NO:376) 

VDG GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK 

PITPVDPTDP TGPKPGTAGP LSIDYASSLS FGEQTITSKN MTYYAETQKY KDNAGADQEG 
PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAKVT LSNGRWSAS QSAKPTTAPA 
TIELNPTGAE SWMAAGDKE GAGTYLMSWG DSVDTAKTS I SLEVPGSTTK YAKKYTTTFT 
WTLTDTPANT GN 

EF099-1 (SEQ ID NO:377) 

TGATGTTGTA GAGGGCTGAT GAAATGTTTA TCAGTCTTCT TTTTATTGAA AGGAGAGATC 
ATGAAGAAAT TAGGCAAGGT TTTAATTGTT AGTTGTTTTA TTTTTATTCT TCCTTTTTTA 
TTATTTTTAG GTGTATTTTC TTCTAGTGAA AGCGGAGATT CTTCCCAGTT TCAGCCCGCT 
ACACCACAGG AAAAAGTAGC ATTAGAAGTT TCTAACTACG TGACGTCACA TGGCGGAACG 
TTGCAGTTTG CTTCCGCTTG GATTGGCAAT ATGGAACATG AAAGTGGATT AAATCCTGCT 
AGAATTCAAA GTGATTTATC GTTTAATTCA GCGATAGCTT TTAATCCTTC GTTAGGCGGT 
TATGGAATTG GGTTAGGACA ATGGGATTCA GGACGAAGAG TTAATTTATT AAATTTTGCA 
AAAAGTCAAA AAAAGGAATG GAAATCAGTA GCTTTACAAA TGGATTTTGC GTGGAATAAG 
GATGGTTCTG ATAGTGACTT ACTTAAAAGA ATGTCTAAAT CAAAAGATGT GAATACACTT 
GCGGTAGATA TTTTGAAGCT GTGGGAACGA GCTGGAACAA AAGATGATCC CGCAGAACAA 
GTAAAAAGAA AGGCTAGTGC TAATAATTGG TATAAACGAC TTTCTACAGG TTCCATGGGC 
GGAGGTTCAG CCAATGTTGG TGGAGGAAAA ATTGATGCCT TGGAAAAAGT GATGGGGCAA 
ACTATTAATG GTGGTCAATG TTATGGCTTA TCTGCTTTTT TTGTTGAAAA ACAAGGAGGT 
CTACAAATGA TGGGTACGGG GCATATGTTT GCGAGTGAAA TTGGTAATGA TTATCCTTGG 
AGTTCAATTG GTTGGACAGT CATAAAGAAT CCAAATTATT CAGATATTAA AGCAGGAGAT 
GTCATTAATT TTGGTCAAGG TGGTGTGGCT ACTAGTATTT ATGGGCATAC TGGTGTAGTG 
GCAAGTGTTG AAGGTAAAAA CAAGTTTACT ACTTATGAGC AAAACGCTGA ACAAGGTCAA 
ATTGTTGCTA AGTATTTTCG GACTTGGGGA TTAGATTTTC CACATGTGAC CAGCATAGTA 
AGGAAATAG 

EF099-2 (SEQ ID NO:378) 

MKCLS VFFLLKGEIM KKLGKVLIVS CFIFILPFLL FLGVFSSSES GDSSQFQPAT 
PQEKVALEVS NYVTSHGGTL QFASAWIGNM EHESGLNPAR IQSDLSFNSA IAFNPSLGGY 
GIGLGQWDSG RRVNLLNFAK SQKKEWKSVA LQMDFAWNKD GSDSDLLKRM SKSKDVNTLA 
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VDILKLWERA GTKDDPAEQV KRKASANNWY KRLSTGSMGG GSANVGGGKI DALEKVMGQT 
INGGQCYGLS AFFVEKQGGL QMMGTGHMFA SEIGNDYPWS SIGWTVIKNP NYSDIKAGDV 
INFGQGGVAT SIYGHTGWA SVEGKNKFTT YEQNAEQGQI VAKYFRTWGL DFPHVTSIVR 
K 

EF099-3 (SEQ ID NO:379) 

TAGTGAA AGCGGAGATT CTTCCCAGTT TCAGCCCGCT 

ACACCACAGG AAAAAGTAGC ATTAGAAGTT TCTAACTACG TGACGTCACA TGGCGGAACG 
TTGCAGTTTG CTTCCGCTTG GATTGGCAAT ATGGAACATG AAAGTGGATT AAATCCTGCT 
AGAATTCAAA GTGATTTATC GTTTAATTCA GCGATAGCTT TTAATCCTTC GTTAGGCGGT 
TATGGAATTG GGTTAGGACA ATGGGATTCA GGACGAAGAG TTAATTTATT AAATTTTGCA 
AAAAGTCAAA AAAAGGAATG GAAATCAGTA GCTTTACAAA TGGATTTTGC GTGGAATAAG 
GATGGTTCTG ATAGTGACTT ACTTAAAAGA ATGTCTAAAT CAAAAGATGT GAATACACTT 
GCGGTAGATA TTTTGAAGCT GTGGGAACGA GCTGGAACAA AAGATGATCC CGCAGAACAA 
GTAAAAAGAA AGGCTAGTGC TAATAATTGG TATAAACGAC TTTCTACAGG TTCCATGGGC 
GGAGGTTCAG CCAATGTTGG TGGAGGAAAA ATTGATGCCT TGGAAAAAGT GATGGGGCAA 
ACTATTAATG GTGGTCAATG TTATGGCTTA TCTGCTTTTT TTGTTGAAAA ACAAGGAGGT 
CTACAAATGA TGGGTACGGG GCATATGTTT GCGAGTGAAA TTGGTAATGA TTATCCTTGG 
AGTTCAATTG GTTGGACAGT CATAAAGAAT CCAAATTATT CAGATATTAA AGCAGGAGAT 
GTCATTAATT TTGGTCAAGG TGGTGTGGCT ACTAGTATTT ATGGGCATAC TGGTGTAGTG 
GCAAGTGTTG AAGGTAAAAA CAAGTTTACT ACTTATGAGC AAAACGCTGA ACAAGGTCAA 
ATTGTTGCTA AGTATTTTCG GACTTGGGGA TTAGATTTTC CACATGTGAC CAGCATAGTA 
AGGAAAT 

EF099-4 (SEQ ID NO:380) 
SES GDSSQFQPAT 

PQEKVALEVS NYVTSHGGTL QFASAWIGNM EHESGLNPAR IQSDLSFNSA IAFNPSLGGY 
GIGLGQWDSG RRVNLLNFAK SQKKEWKSVA LQMDFAWNKD GSDSDLLKRM SKSKDVNTLA 
VDILKLWERA GTKDDPAEQV KRKASANNWY KRLSTGSMGG GSANVGGGKI DALEKVMGQT 
INGGQCYGLS AFFVEKQGGL QMMGTGHMFA SEIGNDYPWS SIGWTVIKNP NYSDIKAGDV 
INFGQGGVAT SIYGHTGWA SVEGKNKFTT YEQNAEQGQI VAKYFRTWGL DFPHVTSIVR 
K 

EF100-1 (SEQ ID NO:381) 

TANTTATGGC AATATGGAAG GAGTTTTATA ATGAAAAAGA AACAAAAATA CGCAGGGTTT 
ACATTATTAG AAATGTTGAT TGTCTTATTG ATTATTTCCG TATTGATTTT ACTTTTTGTC 
CCTAACTTAG CGAAACATAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 
ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
CAATGA 

EF100-2 (SEQ ID NO:382) 

MKKKQKYAGF TLLEMLIVLL IISVLILLFV PNLAKHKETV DKKGNEAIVK 
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 

EF100-3 (SEQ ID NO:383) 

TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 

ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
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CAAT 

EF100-4 (SEQ ID NO:384) 
KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 
EF100-1 (SEQ ID NO:385) 



TANTTATGGC AATATGGAAG GAGTTTTATA ATGAAAAAGA AACAAAAATA CGCAGGGTTT 
ACATTATTAG AAATGTTGAT TGTCTTATTG ATTATTTCCG TATTGATTTT ACTTTTTGTC 
CCTAACTTAG CGAAACATAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 
ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AG C AG AAAAG 
CAATGA 

EF100-2 (SEQ ID NO:386) 

MKKKQKYAGF TLLEMLIVLL IISVLILLFV PNLAKHKETV DKKGNEAIVK 
IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q 

EF100-3 (SEQ ID NO:387) 

TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA 

ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 
CAAT . 

EF100-4 (SEQ ID NO:388) 
KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK • Q 
EF101-1 (SEQ ID NO:389) 



TGAGGAGATG AAACGAAGAA AATGAAGAAG 
GTAATTGCGG TTGGGGGCAT CGTAACTGTG 
GCTGTCAAGC AAGCGCCTAA AGATGACTGG 
CAACAAATTT ATATTAACGG TGTCATCCAA 
CAAAAAATAA CAAAGGATCC AGAGATTAAG 
ACAGAATTAT TTACTTATGA AGATGAGGCG 
AGCTTAGCCA AATTAGAAAC GAAGCGGGCG 
GATAAATTTA ATAAAACTAA AGAAGAAGAC 
CAATATCAAA CAGAAGTCGA TGCAGTAGAT 
GCGGATTTAG GAGCGAAGCA ATATATTTCC 
ATTCCAGAAG TAAAAGATGC CAATTCACCG 
TTAGCTGGAA AAGTGAATGA AAAGGACTTG 
CTAACTTCTG TTTCCAACAA TGTGGTTGTG 
CCTCCTGAAG GCAACAGCGA TGCCGCGAGT 
AGTTATAGCG TCAAAATTGC GTTGGCCAAT 
CAAGCAACCA TTGATTTAGG CGATTTAGGG 
AAAGAGGGTG AACAGGCCTA CGTTTTAGTG 
GTCCAAGTCG GGCAAGAAAA TGGCGACAAA 
GACCGAGTGG TTATTTCTTC AAAAAAACCA 



AAAACGATAA TTATATTGGG GGCAGTTGCG 
AATGCGTTAA ATAAAAATGC ACAACAAGTA 
GGAATTGACT ATTTTGACGT TCCCGACTTG 
CCGGAACAAA TGGAAGCCTT TGCGCGTGAT 
GTGAAAAACG GCGATGTCGT AGATGCAGGC 
GTCACAAAAG AAATTGAGGC ACAACAAAAT 
AATATCTATA ATAAGTGGAA TCGGGCCATT 
CGCACGATGT CTGGTGATGA TTTAAATGAA 
GAAGAGATTA CCTTCACCAA TGAAACCTTA 
ACAAAGGCTA ATTTCAAAGG TCGTGTATCA 
ATTTTACGGT TAACTTCAGA AGATCTTTAT 
ACTAAAATTA GTGTTGGGCA AAAAGCTAAA 
GATGGCTCAA TTTCTTACAT CGATGATAAT 
GGCAATCCAG AGGGCGGCAC AACGATGTCT 
TTAGACAAAG TCAAAAATGG CTACCATATG 
GCGATTGAGT TACCGAAAAA AGCGATTCAA 
AATGATTTTG GAACCATCAT TCGTCGTGAT 
ATGGCGATTG AATCTGGCTT AGAATCAGCC 
GTAAAAGTCG GTGATATTGT TGAATCAGAT 
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GCAGCGATTG CTTCTGATGA ATCAGCAACC AACGAATCAA TGACAGATGC GTCGAAATAG 
EF101-2 (SEQ ID NO:390) 

MKKK TIIILGAVAV IAVGGIVTVN ALNKNAQQVA VKQAPKDDWG IDYFDVPDLQ 
QIYINGVIQP EQMEAFARDQ KITKDPEIKV KNGDWDAGT ELFTYEDEAV TKEIEAQQNS 
LAKLETKRAN IYNKWNRAID KFNKTKEEDR TMSGDDLNEQ YQTEVDAVDE EITFTNETLA 
DLGAKQYIST KANFKGRVS I PEVKDANSPI LRLTSEDLYL AGKVNEKDLT KISVGQKAKL 
TSVSNNVWD GSISYIDDNP PEGNSDAASG NPEGGTTMSS YSVKIALANL DKVKNGYHMQ 
ATIDLGDLGA IELPKKAIQK EGEQAYVLVN DFGTIIRRDV QVGQENGDKM AIESGLESAD 
RWISSKKPV KVGDIVESDA AIASDESATN ESMTDASK 

EF101-3 (SEQ ID NO:391) 

TAAAAATGC ACAACAAGTA 

GCTGTCAAGC AAGCGCCTAA AGATGACTGG GGAATTGACT ATTTTGACGT TCCCGACTTG 
CAACAAATTT ATATTAACGG TGTCATCCAA CCGGAACAAA TGGAAGCCTT TGCGCGTGAT 
CAAAAAATAA CAAAGGATCC AGAGATTAAG GTGAAAAACG GCGATGTCGT AGATGCAGGC 
ACAGAATTAT TTACTTATGA AGATGAGGCG GTCACAAAAG AAATTGAGGC ACAACAAAAT 
AGCTTAGCCA AATTAGAAAC GAAGCGGGCG AATATCTATA ATAAGTGGAA TCGGGCCATT 
GATAAATTTA ATAAAACTAA AGAAGAAGAC CGCACGATGT CTGGTGATGA TTTAAATGAA 
CAATATCAAA CAGAAGTCGA TGCAGTAGAT GAAGAGATTA CCTTCACCAA TGAAACCTTA 
GCGGATTTAG GAGCGAAGCA ATATATTTCC ACAAAGGCTA ATTTCAAAGG TCGTGTATCA 
ATTCCAGAAG TAAAAGATGC CAATTCACCG ATTTTACGGT TAACTTCAGA AGATCTTTAT 
TTAGCTGGAA AAGTGAATGA AAAGGACTTG ACTAAAATTA GTGTTGGGCA AAAAGCTAAA 
CTAACTTCTG TTTCCAACAA TGTGGTTGTG GATGGCTCAA TTTCTTACAT CGATGATAAT 
CCTCCTGAAG GCAACAGCGA TGCCGCGAGT GGCAATCCAG AGGGCGGCAC AACGATGTCT 
AGTTATAGCG TCAAAATTGC GTTGGCCAAT TTAGACAAAG TCAAAAATGG CTACCATATG 
CAAGCAACCA TTGATTTAGG CGATTTAGGG GCGATTGAGT TACCGAAAAA AGCGATTCAA 
AAAGAGGGTG AACAGGCCTA CGTTTTAGTG AATGATTTTG GAACCATCAT TCGTCGTGAT 
GTCCAAGTCG GGCAAGAAAA TGGCGACAAA ATGGCGATTG AATCTGGCTT AGAATCAGCC 
GAGCGAGTGG TTATTTCTTC AAAAAAACCA GTAAAAGTCG GTGATATTGT TGAATCAGAT 
GCAGCGATTG CTTCTGATGA ATCAGCAACC AACGAATCAA TGACAGATGC GTCGAAAT 

EF101-4 (SEQ ID NO:392) 

KNAQQVA VKQAPKDDWG IDYFDVPDLQ 

QIYINGVIQP EQMEAFARDQ KITKDPEIKV KNGDWDAGT ELFTYEDEAV TKEIEAQQNS 
LAKLETKRAN IYNKWNRAID KFNKTKEEDR TMSGDDLNEQ YQTEVDAVDE EITFTNETLA 
DLGAKQYIST KANFKGRVS I PEVKDANSPI LRLTSEDLYL AGKVNEKDLT KISVGQKAKL 
TSVSNNVWD GSISYIDDNP PEGNSDAASG NPEGGTTMSS YSVKIALANL DKVKNGYHMQ 
ATIDLGDLGA IELPKKAIQK EGEQAYVLVN DFGTIIRRDV QVGQENGDKM AIESGLESAD 
RWISSKKPV KVGDIVESDA AIASDESATN ESMTDASK 

EF102-1 (SEQ ID NO:393) 

TAAACATTTG AGACATTCAG AGGTGAATGT CTCTTTTTTA TTACTCAAAA ACGAAAGGGG 
ATTAATTATA TGAAAAAAAC AACATTTAAA AATTGGTCGT TATTTGCGAC TTTGGCTCTA 
TTAAGTCAAA CAATTGGCGG AACGATTGGT CCTACGATTG CTTTTGCCGA TGAAATT AC T 
CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGTCAT TATATGCAGA ATACAATGGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTTAGTA TTCCAACAGA AGTGACGCAC 
GGTTATCAGA AAAACCCTTT GCCATCAATG TCTGATAAAG CGAAACTAGT ATCGGTTCTT 
TGGGAAAAGG CTGGAACAGA TATTGATACA AATATGGTTG CACAAAAGAT GATTTGGGAA 
GAAGTGAACG GTTATAAACT CCATTCCATA AAAAGATTAG GTGGTGCTTC AGTTGATATA 
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AAATCTATTG AAGGAAAAAT TAATAAGGCA ATTGAGGAGT ATCAAAAAAA ACCAAGTTTT 
CATAATACCA CTGTAAAAAC AATTTTAGGT CAATCGACAA CTTTAATAGA TAAAAATGAA 
TTAAATTTAT CTGAGTTTGA TAAAGTCGTC CAAAATACGG CGAATATAGA TTACCGTGTA 
ATTGGGAATC AATTAGTGCT TACTCCAAAC TCTAATTCCA AATCAGGAAC ATTAACATTG 
AAAAAATCAG CTGGTACTGG AACTCCAGTC GCTTATAAAA AAGCAGGACT TCAAACTGTG 
ATGGCTGGTG CGCTTGATAA GCCCAATACC TACGCTATTA AAATTAATGT GGAAACTAAG 
GGTTCTTTAA AGATCAAAAA AATCGATAAA GAATCAGGTG ATATTGTACC AGAAACGGTT 
TTCCATTTAG ATTTTGGGAA AGCTTTACCT TCAAAAGATG TGACAACAGA TAAAGATGGG 
ATTTCTATTT TGGATGGAAT TCCCCATGGT ACAAAGGTAA CTATTAGTGA AAAATCGGTG 
CCAGATCCTT ATATGATTGA TACCACACCC ATGGCTGCCA CCATTAAAGC GGGCGAGACC 
ATTTCCATGA CTTCGAAAAA TATGCGACAA AAAGGTCAAA TTCTTTTAGA GAAGACTGGG 
GTAGAAACAG GTACTGATCT TTGGAATGAC AATTATTCTC TAGCTGGAAA TACATTTGCC 
ATTCGTAAAG ACAGCCCAGC TGGTGAAATT GTCCAAGAAA TAACAACGGA TGAAAAAGGT 
CGTGCGGAAA CACCAAAAGA GCTTGCTAAT GCTTTGGAAC TGGGAACCTA TTACGTGACA 
GAAACTAAAT CTAGTAATGG TTTCGTGAAT ACCTTCAAAC CAACAAAAGT CGAGTTAAAA 
TATGCCAATC AAACCGTGGC TCTTGTTACC AGTAACGTAA AAGGGCAAAA CCAAGAAATT 
AC TGGGGAAA CCACTTTGAC AAAAGAAGAC AAAGATACCG GTAATGAGAG TCAAGGGAAA 
GCTGAGTTTA AAGGAGCTGA ATATACTCTC TTTACTGCAA AAGATGGTCA AGCTGTTAAA 
TGGAGTGAAG CTTTTAAAAC AGAATTAGTG AAGGGAACGA AAGCTTCTGA TGAAACAGTG 
ACTTTGGCTT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGC TTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT CGCCATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGACG GAAATAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGC C A AAC CGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC CCAGCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGTT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA CATTCCCGCA AACTGGGGAG AAAAATTCCA ACGTTCTACT GTTAGTTGGC 
TTTATCTTGA TTTTTTCGAC TGCTGGGTAT TATTTCTGGA ATCGCCGCAA TTAA 

EF102-2 (SEQ ID NO:394) 

MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA KQTVFCIEPG VSIPTEVTHG 
YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVAQKMIWEE VNGYKLHS IK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM AGALDKPNTY AIKINVETKG 
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SLKIKKIDKE SGDIVPETVF HLDFGKALPS KDVTTDKDGI SILDGIPHGT KVTITEKSVP 
DPYMIDTTPM AATIKAGETI SMTSKNMRQK GQILLEKTGV ETGTDLWNDN YSLAGNTFAI 
RKDSPAGEIV QEITTDEKGR AETPKELANA LELGTYYVTE TKSSNGFVNT FKPTKVELKY 
ANQTVALVTS NVKGQNQEIT GETTLTKEDK DTGNESQGKA EFKGAEYTLF TAKDGQAVKW 
SEAFKTELVK GTKASDETVT LALDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 
IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VKTFPQTGEK NSNVLLLVGF 
ILIFSTAGYY FWNRRN 

EF102-3 (SEQ ID NO:395) 

TT TAGATGAAAA GAACCAAGTT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT 
TTCTGGCAAG AAACCAAAGC ACCTGAAGGA TATACTTTGG ATGAAACGAA GTATCCTGTA 
TCCATCAAAA AAGTTGATAA TAACGAAAAA AATGCCGTAA TTACTCGAGA TGTTACGGCA 
AAAGAACAAG TTATTCGCTT TGGCTTTGAT TTCTTTAAAT TTGCTGGATC GGCTGATGGC 
ACTGCCGAAA CTGGATTTAA CGACTTATCT TTTAAAGTGT GGC CATTGGA AGGGACCAAN 
GAAATCACAG GTGCTGAAGA TAAAGCGACC ACAGCTTGTA ACGAGCAATT AGGTTTTGAT 
GGCTATGGTA AGTTTGAAAA TCTTCCTTAT GGGGATTATT TACTTGAAGA AATAGAGGCT 
CCAGAAGGAT TTCAAAAGAT TACACCACTA GAAATCCGTT CTACATTTAA GGAAAACAAA 
GACGACTATG CGAAGAGTGA GTATGTCTTT ACCATTACCG AAGAAGGACA AAAACAACCA 
ATTAAGATGG TGACCGTTCC TTACGAGAAA CTAACTAACA ACGAGTTTTC TGTTAGTCTG 
AACCGTTTGA TGCTTTATGA TTTGCCCGAG AAAGAAGATA GTTTGACTTC TCTTGCGACT 
TGGAAAGAGG GAAATAAAAA ATTGAATACC CTTGATTTTA CCGAGCTAGT TGATAAATTG 
AGATATAACT TGCATGAAAT CAAAGAAGAC TGGTATGTCG TAGCTCAAGC CATTGATGTG 
GAAGCCACAA AAGCTGCCCA AGAAAAAGAC GAAAAAGCCA AACCGGTGGT GATTGCCGAA 
ACAACCGCAA CGTTGGCGAA CAAAGAGAAA ACTGGAACTT GGAAAATTCT GCATAAATTA 
ACCGCTGAAC AAGTTTTGGA TAAAAGCATC GTCTTGTTCA ATTATGTGTA TGAAAACAAG 
GTAGCCTTTG AAGCAGGCAA TGAGCCAGTA GCGAAGGATG CTAGCTTGAA CAATCAAGCA 
CAAACCGTCA ATTGTACGAT TGAACGCCAT GTTTCCATCC AAACAAAAGC CCACCTAGAA 
GATGGTTCGC AAACTTTTAC TCATGGTGAC GTGATGGATA TGTTTGATGA TGTGTCGGTT 
ACCCATGATG TACTGGATGG CTCAAAAGAA GCTTTCGAAA CAATTCTGTA TGCTTTACTA 
CCAGATGGTA CGAACAAAGA AATTTGGAAA TCTGGCAAAA TTGAGCATGA AGTGAATGAT 
AAAGAATTTA CCAAAACCGT ACTTGCGGAA AAAGTAGATA CCGGAAAGTA TCCAGAAGGA 
ACTAAGTTTA CTTTTACGGA AATCAATTAC GAAAAAGATG GAAACGTGAA TGGAAAACAC 
AATGAAGATT TGAAAGAAAA ATCTCAAACC TTAACACCAA AAGAAGTGCC AACCATACCG 
AGTACGCCAA AACAACCGGA AACACCAGCT GTTCCAAGTA ATTCTCAAGA ATCTAGTCCC 
ACAGTGAAGA 

EF102-4 (SEQ ID NO:396) 

LDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS 

IKKVDNNEKN AVITRDVTAK EQVIRFGFDF FKFAGSADGT AETGFNDLSF KVSPLEGTXE 
ITGAEDKATT ACNEQLGFDG YGKFENLPYG DYLLEEIEAP EGFQKITPLE IRSTFKENKD 
DYAKSEYVFT ITEEGQKQPI KMVTVPYEKL TNNEFSVSLN RLMLYDLPEK EDSLTSLATW 
KDGNKKLNTL DFTELVDKLR YNLHEIKEDW YWAQAIDVE ATKAAQEKDE KAKPWIAET 
TATLANKEKT GTWKILHKLT AEQVLDKSIV LFNYVYENKV AFEAGNEPVA KDASLNNQAQ 
TVNCTIERHV SIQTKAHLED GSQTFTHGDV MDMFDDVSVT HDVLDGSKEA FETILYALLP 
DGTNKEIWKS GKIEHEVNDK EFTKTVLAEK VDTGKYPEGT KFTFTEINYE KDGNVNGKHN 
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EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VK 
EF103-1 (SEQ ID NO:397) 

TAAGATAGGT TTATCAAAGA AAAGGAGCGA TGCTTTATGA AAAAGAAAGT ATTAAGTTCG 
ATTACTTTAG TAACATTAAG TACGTTACTT ATAGCAGGTT ATGCAAGTCC AGCATTTGCA 
GATCATGCAG CCAATCCAAA TAGTGCTACA GCAAATTTAG GCAAACATCA AAACAATGGC 
CAAACAAGAG GCGACAAGGC GACTAAGATT TTATCTGGCA CGGACTGGCA AGGAACCCGT 
GTTTATGATG CTGCTGGTAA TGATTTAACG GCAGAAAATG CTAATTTTAT TGGTTTAG C A 
AAATATGATG GTGAAACCGG TTTTTACGAG TTTTTCGACA AAAATACTGG GGAAACCCGT 
GGTGACGAAG GAACATTTTT TGTGACAGGT GATGGCACAA AACGAATTTT AATTTCGCGG 
ACACAAAATT ATCAAGCCGT AGTGGATTTA ACCGAAGTGA GTAAAGACNA ATTTACTTAC 
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT GATGTTGAAG TTTACGTGGA ACACATCCCT 
TATCATGGGA AAAAATTAGC TTTTACAAAT GGACGTGAAG CATTAACCAA TCAAACTGGC 
AAAATTGTGA CAAATAAATC AGGGGATAAA ATTTTAGGAA CAACCTTGTG GAATGGCACA 
AAAGTCGTAG ATAAAAACGG TAATGATGTG ACAGCGGCCA ATCAAAATTT CATTAGTTTA 
GCGAAATTTG ATCCAAACAC AAGTAAATAT GAATTTTTCA ATTTACAAAC AGGTGAAACC 
CGCGGCGACT TTGGGTACTT CCAAGTGGTG GACAATAACA AGATTCGGGC CCATGTATCT 
ATTGGTACGA ATCGTTACGG CGCGGCGCTA GAATTAACGG AACTAAACAA TGATCGATTT 
ACGTATACTC GAATGGGTAA AGATAATGCT GGTAATGATA TTCAAGTGTT CGTGGAACAT 
GAACCTTACC AAGGCACATA TCATCCAGCC TTTACTTTCT AA 

EF103-2 (SEQ ID NO:398) 

MKKKVLSS I TLVTLSTLLI AGYASPAFAD HAANPNSATA NLGKHQNNGQ 
TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLAK YDGETGFYEF FDKNTGETRG 
DEGTFFVTGD GTKRILISRT QNYQAWDLT EVSKDXFTYK RLGKDKLGND VEVYVEHIPY 
HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTTLWNGTK WDKNGNDVT AANQNFISLA 
KFDPNTSKYE FFNLQTGETR GDFGYFQWD NNKIRAHVSI GTNRYGAALE LTELNNDRFT 
YTRMGKDNAG NDIQVFVEHE PYQGTYHPAF TF 

EF103-3 (SEQ ID NO:399) 

TCATGCAG CCAATCCAAA TAGTGCTACA GCAAATTTAG GCAAACATCA AAACAATGGC 
CAAACAAGAG GCGACAAGGC GACTAAGATT TTATCTGGCA CGGACTGGCA AGGAACCCGT 
GTTTATGATG CTGCTGGTAA TGATTTAACG GCAGAAAATG CTAATTTTAT TGGTTTAGCA 
AAATATGATG GTGAAACCGG TTTTTACGAG TTTTTCGACA AAAATACTGG GGAAACCCGT 
GGTGACGAAG GAACATTTTT TGTGACAGGT GATGGCACAA AACGAATTTT AATTTCGCGG 
ACACAAAATT ATCAAGCCGT AGTGGATTTA ACCGAAGTGA GTAAAGACNA ATTTACTTAC 
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT GATGTTGAAG TTTACGTGGA ACACATCCCT 
TATCATGGGA AAAAATTAGC TTTTACAAAT GGACGTGAAG CATTAACCAA TCAAACTGGC 
AAAATTGTGA CAAATAAATC AGGGGATAAA ATTTTAGGAA CAACCTTGTG GAATGGCACA 
AAAGTCGTAG ATAAAAACGG TAATGATGTG ACAGCGGCCA ATCAAAATTT CATTAGTTTA 
GCGAAATTTG ATCCAAACAC AAGTAAATAT GAATTTTTCA ATTTACAAAC AGGTGAAACC 
CGCGGCGACT TTGGGTACTT CCAAGTGGTG GACAATAACA AGATTCGGGC CCATGTATCT 
ATTGGTACGA ATCGTTACGG CGCGGCGCTA GAATTAACGG AACTAAACAA TGATCGATTT 
ACGTATACTC GAATGGGTAA AGATAATGCT GGTAATGATA TTCAAGTGTT CGTGGAACAT 
GAACCTTACC AAGGCACATA TCATCCAGCC T 



EF103-4 (SEQ ID NO:400) 

HAANPNSATA NLGKHQNNGQ 
TRGDKATKIL SGTDWQGTRV YDAAGNDLTA 
DEGTFFVTGD GTKRILISRT QNYQAWDLT 



ENANFIGLAK YDGETGFYEF FDKNTGETRG 
EVSKDXFTYK RLGKDKLGND VEVYVEHIPY 
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HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTTLWNGTK- WDKNGNDVT AANQNFISLA 
KFDPNTSKYE FFNLQTGETR GDFGYFQWD NNKIRAHVSI GTNRYGAALE LTELNNDRFT 
YTRMGKDNAG NDIQVFVEHE PYQGTYHPA 

EF104-1 (SEQ ID NOr401) 



TGAAAGGGGA TTAGTATGAA GAAAAAAACT 
GCACAAAATT TCGGGTTTGC CGTAAATGCC 
GAGACCACTG ATACAGCAAA AAAAGAGGCA 
TTAGCAACAA CGACTACTTC AGAAATGAAT 
ACAGAGGCGA GCACAACAGC TTCCAGTGAT 
GAGGACAAGG ACACCTCACT TAATGAAAAA 
GATGAACTAC TTGACAGTAT GAGTCTTGCG 
GAGGTTCACC GCGAATTAAA TACAACACCG 
AACACAGGTT ATGCACCTGG ATCAGTTTAT 
TCAACTGTCA GCGGAGAAGT GACAGGCATT 
AAAACATTAA GTATTACGTT TAATCAACGA 
AAAAGTTATC TAACAACAGA GGCGGAACCA 
AAAAAAACCT ACTCGTTTGA TTTATATGAA 
ACCAGAACGA CGGGGTTAGA TGGCGAAATT 
AATCAAACAT TAGAATTATT AACAACAGAG 
AACTTGGAAC CTCAAGTTTT CAGTTACGAT 
ACGCAAACCT TGTTAACACC TGGCAAAGAT 
ATTGCTGTAA CTGTTCCAAA CATGAATCAA 
ACAATTTATT TAGAGAGTGC TTCGGACTAT 
ACAAAAATTG GGTCAATTTC TTTGAAAAGT 
ACTGCTAAGA CGAGTCAAAC AAGTAAAGTA 
TATATCAGTT TTCAAAGCAA AGGGAAATAT 
ACAAAAGTGG GTCAACAAAT CGTATTAGAG 
AAATTTACGG CGTATGGTCC TTTATATGAA 
AAAACTGAAG GTGGCAAGCT CACTTTAACG 
ATTTCTGATT TAACAATGGA TTTTGACAAG 
GTAATTGGTC CTAATAAAGC CATTCAATTA 
GTTGTTAATC CTTTGAATGC TGAAACTGCT 
TCATCAAGAA CAACTGTCTC AGTTATGGGA 
ATTAAAGTAA AGCATCCTAA TTATCTTTCA 
TACAAGTTAG GAACGGATTA TACAGTAACG 
ACTACGCCAA TAACCAACGA AATCCAAATT 
TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT 
TTAACTCCAG TTGATACGAC AGTAACTACT 
CAAAGTAGTA AAAATCAATT CCTTGTCAAT 
GTCCGTACAA AAATTCCAGC TGGCGCCGAT 
GATCAGGTAG ATTCAATTTA TCCACAATAC 
ATGACGCCAA ACAGCCCTGG ATATCCAACG 
ACGTTTGATT TTGGAAAAAC CAACAAACGT 
TGGATCGACG TGCCAACTCT TTATATAACA 
AATGAAGGCT CTGCTTCGGT TTCTGTTCAA 
CAAGCGGCGA ATCCAACATT AAAAAATGTA 
GATAATAAAA CACATCGTGT GAAAAATCCA 
AATGCTCAAA TCGATTTGAA TTCTATTACC 
TTAGAGAAGA CTACAAACGG TGCGAAAGTC 
ATTACGATTG AATACAATAC GGTCTCTGCA 
ATCGACTCTG AAACATTGAA CCAGATGTCT 
ATCACATTGA AATTCTCAGA AGGTGATGCG 



TTTTCTTTTG TGATGTTGAG TATACTTCTC 
TATGCTGTAA CAACGACAGA AGCACAAACA 
GAGTTATCGA ACTCAACACC ATCTTTACCT 
CAACCAACTG CAACAACTGA ATCGCAAACC 
GCTGGTACAC CATCTGAACA ACAAACAACG 
GCCCTGCCAG ATGTTCAAGC GCCAATTACA 
CCGATTGGTG GAACAGAATA CAGCCAAACA 
GTAACCGCTA CGTTCCAATT TGCTGTTGGA 
ACAGTTCAAT TACCAGAACA TTTAGGTTAT 
GGCGCAACTT GGGCAGTCGA TGCGGCGACC 
GTTTCAGATA CTTCCTTTAA AGTAGAACTA 
TTAATCAAAA TTGAAACTCC AGGAAAAAAT 
CAAGTGGAAC CAATTCAATA TAACGAACGA 
TTTTATAATT TAGACCGGAC GTTAACTGGC 
ACGCCAGGCG CTGTCTTTGG AAAACAAGAT 
GTCGACATTA ATGGTCAAAT TTTACCAGAA 
TATACATTAA GCGATAATTC ACTCGGGCGG 
CAAAAAGCCT ATTCCTTATC GATTAATCGG 
AACTACTTAT ATTCGCAGCA GTATCCAACA 
ACGACAGGAA CTAAACAAAC AACCGATTTT 
ATTGCTGATC GTGAAATGCG TAGTATGTCC 
TATGTAACAA TTTATGGCAC GTTAACAGAA 
AGTACAAACG GTGAAGAAAT TAAGAATCCT 
AATGTAAAAT TGGAAGACTA TTTTGATATT 
GCCACAAAAG ATAGCTATTT AAGAATAAAT 
AAGGACATTA ATCTATCATT AAGTACACCT 
GTATCCGATC AATATATTGA AC C AATTAGT 
TGGGGTAATT ATGATCAAAA TGGTGCCTAT 
AGCAAAGAGA AACCGATTCA AAATTTAGAA 
TTACGAGCTA CAAAAGAAAT TTATTTTTAT 
CCAACGTCAG ATGGTTCAGT TATTAAGTTC 
CCAATTGGTT TTAATTATGT GCCAGATAGT 
ACGATACCGA TAACAATGAG TGCTGAAGGT 
AATAGTAAGC GTGGTTCTGA ACGAACACTT 
GCACGAAATG ATTCTTTTGA CTCACTAAGC 
GTTCTTTTTG ACATTTATGA TGTTTCAAAC 
TGGGACCGCG GTCAATACTT TGATAAAC C A 
ATTACTTTTG ACGAAAATAC CAATAGTTAC 
TACATTATTG AGTATAAAAA CGCCAATGGC 
GGGACAGCGA AAGAACCACA ATCGAATAAT 
AATGAAGCGT TAGACATTTT GAGTGCAACA 
ACAAAAACGA CAGTAACAAC AAAAAATATT 
ACGATTGAAT TAACACCAAA AGGCACAACC 
GTGAAAGGCG TGCCAGAAGA TGCTTATTCA 
ATTTTTAAAG ACTATACATT GACAGAAAAC 
AACGCTGGCC AAATCTATAC AGAAACAACA 
GCTAGCAAGA AAAAAGTCAC CACTGCGCCA 
GAAGGTATTG TTTATTTAGC AACTGCCACA 
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TTCTACACGC ATAACGTAGA GGATGAAAAC CAAGCAATTG CGAAGGTTTC TTTTGAACTA 
ATTGATAATG TCACGCATAC AGCAACCGAA TTTACAACAG ATGAAAAAGG TCAATACTCC 
TTTGATGC C A TCATGACAGG TGATTATACT TTGCGAGTAA CGAATGTACC GCAGGAATAT 
TCCGTGGATG AAGAGTATTT GACAGGAAAA GCCATTAAGC TGGTCAAAGG AGACAACCAA 
CTAAAAATTC CATTAACGAA AACAATTGAT CACAGTCGTT TACAAGTCAA AGATTCAACG 
ATTTATGTCG GCGATTCATG GAAACCAGAA GAGAACTTTG TTTCAGCAAC AGATAAAACA 
GGTCAAGACG TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA CANCAAAGCA 
GGCGTTTATC CAATTATTTA CAGTGACGAA GGTAAAGAAG AAACAGCCTA TGTGACCGTC 
AAACCCGACC AATCTAAGTT AGAGGTCAAA GATACAACGA TTTATGTTGG TGATTCGTGG 
AAACCAGAAG ATAATTTCGT TTCAGCGACA GACAAAACAG GTCAAGACGT NCCGTTTGAA 
AAAATTGATG TTCAGGGAAC AGTGAATGTT GATAAAATAG GCGATTATGA AATTGTCTAT 
AAAAATGGCA NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG TCAGTTAGAG 
GTTAAAGATA CAACGATTTA TGTTGGTGAT TCGTGGAAAC CAGAAGATAA TTTCGTTTCA 
GCAACAGACA AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC AGGTCAAGTT 
GATACTAGCA AAGCAGGCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA AGAAGAAACA 
GCTAATGTGA CTGTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC AACGATTTAT 
GTGGGCGATA AATGGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA AACAGGTCAA 
GATGTCCCGT TTGAAAAAAT TGACGTTCAG GGAACAGTGA ATGTTGATAA AATAGGCGAT 
TATGAAATTG TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA TGTCCGTGAT 
GACAGTCAGT TAGAGGTCAA AGATACAACA ATTTATGTGG GTGATAAATG GGAAGCAGAA 
GATAACTTCG TTTCCGCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA AAAAATTGAT 
GTTCAGGGAA CAGTGAATGT TGATAAAATA GGCGATTATG AAATTGTCTA TAAAAATGGC 
ACAAAAGAAG CGAAAGCAAT CGTTCATGTC CGTGATGATA GTCGTTTACA AGTCAAGGAT 
ACAACGATTT ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC AGCNACAGAT 
AAAACAGGTC AAGATGTCCC ATTCGAAAAA ATCACTGTT 

EF104-2 (SEQ ID NO:402) 

MKKKTF SFVMLSILLA QNFGFAVNAY AVTTTEAQTE TTDTAKKEAE LSNSTPSLPL 
ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA LPDVQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAPGSVYT VQLPEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL IKIETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YNLDRTLTGN QTLELLTTET PGAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNMNQQ KAYSLSINRT 
IYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI ADREMRSMSY 
ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNGQEIKNPK FTAYGPLYEN VKLEDYFDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV SDQYIEPISV 
VNP LNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL RATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP IGFNYVPDSL PKDKSIPVDT IPITMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA RNDSFDSLSV RTKIPAGADV LFDIYDVSND 
QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY IIEYKNANGW 
IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANPTLKNVT KTTVTTKNID 
NKTHRVKNPT IELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI FKDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL RVTNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SRLQVKDSTI YVGDSWKPEE NFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD TTIYVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KIGDYEIVYK NGXKEAKAIV HVRDDSQLEV 
KDTT I YVGDS WKPEDNFVSA TDKTGQDVPF EKITVSGQVD TSKAGVYPIV YSYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG TVNVDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG QDVPFEKIDV 
QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR DDSRLQVKDT TIYVGDSWXP EXNFVSATDK 
TGQDVPFEKI TV 
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EF104-3 (SEQ ID NO:403) 
TGTAA CAACGACAGA AGCACAAACA 

GAGACCACTG ATACAGCAAA AAAAGAGGCA GAGTTATCGA ACTCAACACC ATCTTTACCT 
TTAGCAACAA CGACTACTTC AGAAATGAAT CAACCAACTG CAACAACTGA ATCGCAAACC 
AC AGAGGC GA GCACAACAGC TTCCAGTGAT GCTGCTACAC CATCTGAACA ACAAACAACG 
GAGGACAAGG ACACCTCACT TAATGAAAAA GCCCTGCCAG ATGTTCAAGC GCCAATTACA 
GATGAACTAC TTGACAGTAT GAGTCTTGCG CCGATTGGTG GAACAGAATA CAGCCAAACA 
GAGGTTCACC GCGAATTAAA TACAACACCG GTAACCGCTA CGTTCCAATT TGCTGTTGGA 
AACACAGGTT ATGCACCTGG ATCAGTTTAT ACAGTTCAAT TACCAGAACA TTTAGGTTAT 
TCAACTGTCA GCGGAGAAGT GACAGGCATT GGCGCAACTT GGGCAGTCGA TGCGGCGACC 
AAAACATTAA GTATTACGTT TAATCAACGA GTTTCAGATA CTTCCTTTAA AGTAGAACTA 
AAAAGTTATC TAACAACAGA GGCGG AACCA TTAATCAAAA TTGAAACTCC AGGAAAAAAT 
AAAAAAACCT ACTCGTTTGA TTTATATGAA CAAGTGGAAC CAATTCAATA TAACGAACGA 
ACCAGAACGA CGGGGTTAGA TGGCGAAATT TTTTATAATT TAGACCGGAC GTTAACTGGC 
AATCAAACAT TAGAATTATT AACAACAGAG ACGCCAGGCG CTGTCTTTGG AAAACAAGAT 
AACTTGGAAC CTCAAGTTTT CAGTTACGAT GTCGACATTA ATGGTCAAAT TTTACCAGAA 
ACGCAAACCT TGTTAACACC TGGCAAAGAT TATACATTAA GCGATAATTC ACTCGGGCGG 
ATTGCTGTAA CTGTTCCAAA CATGAATCAA CAAAAAGCCT ATTCCTTATC GATTAATCGG 
ACAATTTATT TAGAGAGTGC TTCGGACTAT AACTACTTAT ATTCGCAGCA GTATCCAACA 
ACAAAAATTG GGTCAATTTC TTTGAAAAGT ACGACAGGAA CTAAACAAAC AACCGATTTT 
ACTGCTAAGA CGAGTCAAAC AAGTAAAGTA ATTGCTGATC GTGAAATGCG TAGTATGTCC 
TATATCAGTT TTCAAAGCAA AGGGAAATAT TATGTAACAA TTTATGGCAC GTTAACAGAA 
ACAAAAGTGG GTCAACAAAT CGTATTAGAG AGTACAAACG GTCAAGAAAT TAAGAATCCT 
AAATTTACGG CGTATGGTCC TTTATATGAA AATGTAAAAT TGGAAGACTA TTTTGATATT 
AAAACTGAAG GTGGCAAGCT CACTTTAACG GCCACAAAAG ATAGCTATTT AAGAATAAAT 
ATTTCTGATT TAACAATGGA TTTTGACAAG AAGGACATTA ATCTATCATT AAGTACACCT 
GTAATTGGTC CTAATAAAGC CATTCAATTA GTATCCGATC AATATATTGA ACCAATTAGT 
GTTGTTAATC CTTTGAATGC TGAAACTGCT TGGGGTAATT ATGATCAAAA TGGTGCCTAT 
TCATCAAGAA CAACTGTCTC AGTTATGGGA AGCAAAGAGA AACCGATTCA AAATTTAGAA 
ATTAAAGTAA AGCATCCTAA TTATCTTTCA TTACGAGCTA CAAAAGAAAT TTATTTTTAT 
TACAAGTTAG GAACGGATTA TACAGTAACG CCAACGTCAG ATGGTTCAGT TATTAAGTTC 
ACTACGCCAA TAACCAACGA AATCCAAATT CCAATTGGTT TTAATTATGT GCCAGATAGT 
TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT ACGATACCGA TAACAATGAG TGCTGAAGGT 
TTAACTCCAG TTGATACGAC AGTAACTACT AATAGTAAGC GTGGTTCTGA ACGAACACTT 
CAAAGTAGTA AAAATCAATT CCTTGTCAAT GCACGAAATG ATTCTTTTGA CTCACTAAGC 
GTCCGTACAA AAATTCCAGC TGGCGCCGAT GTTCTTTTTG ACATTTATGA TGTTTCAAAC 
GATCAGGTAG ATTCAATTTA TCCACAATAC TGGGAC CGCG GTCAATACTT TGATAAACCA 
ATGACGCCAA ACAGCCCTGG ATATCCAACG ATTACTTTTG ACGAAAATAC CAATAGTTAC 
ACGTTTGATT TTGGAAAAAC CAACAAACGT TACATTATTG AGTATAAAAA CGCCAATGGC 
TGGATCGACG TGCCAACTCT TTATATAACA GGGACAGCGA AAGAACCACA ATCGAATAAT 
AATGAAGGCT CTGCTTCGGT TTCTGTTCAA AATGAAGCGT TAGACATTTT GAGTGCAACA 
CAAGCGGCGA ATCCAACATT AAAAAATGTA ACAAAAACGA CAGTAACAAC AAAAAATATT 
GATAATAAAA CACATCGTGT GAAAAATCCA ACGATTGAAT TAACACCAAA AGGCACAACC 
AATGCTCAAA TCGATTTGAA TTCTATTACC GTGAAAGGCG TGCCAGAAGA TGCTTATTCA 
TTAGAGAAGA CTACAAACGG TGCGAAAGTC ATTTTTAAAG ACTATACATT GACAGAAAAC 
ATTACGATTG AATACAATAC GGTCTCTGCA AACGCTGGCC AAATCTATAC AGAAACAACA 
ATCGACTCTG AAACATTGAA CCAGATGTCT GC TAGCAAGA AAAAAGTCAC CACTGCGCCA 
ATCACATTGA AATTCTCAGA AGGTGATGCG GAAGGTATTG TTTATTTAGC AACTGCCACA 
TTCTACACGC ATAACGTAGA GGATGAAAAC CAAGCAATTG CGAAGGTTTC TTTTGAACTA 
ATTGATAATG TCACGCATAC AGCAACCGAA TTTACAACAG ATGAAAAAGG TCAATACTCC 
TTTGATGCCA TCATGACAGG TGATTATACT TTGCGAGTAA CGAATGTACC GCAGGAATAT 
TCCGTGGATG AAGAGTATTT GACAGGAAAA GCCATTAAGC TGGTCAAAGG AGACAACCAA 
CTAAAAATTC CATTAACGAA AACAATTGAT CACAGTCGTT TACAAGTCAA AGATTCAACG 
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ATTTATGTCG GCGATTCATG GAAACCAGAA GAGAACTTTG TTTCAGCAAC AGATAAAACA 
GGTCAAGACG TTCCCTTCGA AAAAATCACT GTTTCAGGTC AAGTTGATAA CANCAAAGCA 
GGCGTTTATC CAATTATTTA CAGTGACGAA GGTAAAGAAG AAACAGCCTA TGTGACCGTC 
AAACCCGACC AATCTAAGTT AGAGGTCAAA GATACAACGA TTTATGTTGG TGATTCGTGG 
AAACCAGAAG ATAATTTCGT TTCAGCGACA GACAAAACAG GTCAAGACGT NCCGTTTGAA 
AAAATTGATG TTCAGGGAAC AGTGAATGTT GATAAAATAG GCGATTATGA AATTGTCTAT 
AAAAATGGCA NAAAAGAAGC GAAAGCAATC GTTCATGTCC GTGATGACAG TCAGTTAGAG 
GTTAAAGATA CAACGATTTA TGTTGGTGAT TCGTGGAAAC CAGAAGATAA TTTCGTTTCA 
GCAACAGACA AAACAGGCCA AGACGTTCCG TTTGAAAAAA TCACTGTTTC AGGTCAAGTT 
GATACTAGCA AAGCAGGCGT TTATCCAATC GTTTACAGTT ACGAAGGTAA AGAAGAAACA 
GCTAATGTGA CTGTCAAACC CGACCAATCT AAGTTAGAGG TTAAAGATAC AACGATTTAT 
GTGGGCGATA AATGGGAACC AGAAGATAAT TTCGTTTCAG CAACAGACAA AACAGGTCAA 
GATGTCCCGT TTGAAAAAAT TGACGTTCAG GGAACAGTGA ATGTTGATAA AATAGGCGAT 
TATGAAATTG TCTATAAAAA TGGCACAAAA GAAGCGAAAG CAATCGTTCA TGTCCGTGAT 
GACAGTCAGT TAGAGGTCAA . AGATACAACA ATTTATGTGG GTGATAAATG GGAAGCAGAA 
GATAACTTCG TTTCCGCGAC AGACAAAACA GGTCAAGACG TTCCGTTTGA AAAAATTGAT 
.GTTCAGGGAA CAGTGAATGT TGATAAAATA GGCGATTATG AAATTGTCTA TAAAAATGGC 
ACAAAAGAAG CGAAAGCAAT CGTTCATGTC CGTGATGATA GTCGTTTACA AGTCAAGGAT 
ACAACGATTT ATGTCGGCGA TTCNTGGANA CCAGAAGNGA ACTTTGTTTC AGCNACAGAT 
AAAACAGGTC AAGATGTCCC ATTC 

EF104-4 (SEQ ID NO:404) 

VTTTEAQTE TTDTAKKEAE LSNSTPSLPL 

ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPSEQQTTE DKDTSLNEKA LPDVQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFQFAVGN TGYAPGSVYT VQLPEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNQRV SDTSFKVELK SYLTTEAEPL IKIETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YNLDRTLTGN QTLELLTTET PGAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTPGKDY TLSDNSLGRI AVTVPNMNQQ KAYSLSINRT 
IYLESASDYN YLYSQQYPTT KIGSISLKST TGTKQTTDFT AKTSQTSKVI ADREMRSMSY 
ISFQSKGKYY VTIYGTLTET KVGQQIVLES TNGQEIKNPK FTAYGPLYEN VKLEDYFDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDKK DINLSLSTPV IGPNKAIQLV SDQYIEPISV 
VNPLNAETAW GNYDQNGAYS SRTTVSVMGS KEKPIQNLEI KVKHPNYLSL RATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP IGFNYVPDSL PKDKSIPVDT IPITMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA RNDSFDSLSV RTKIPAGADV LFDIYDVSND 
QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TFDENTNSYT FDFGKTNKRY IIEYKNANGW 
IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANPTLKNVT KTTVTTKNID 
NKTHRVKNPT IELTPKGTTN AQIDLNSITV KGVPEDAYSL EKTTNGAKVI FKDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGDAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATEF TTDEKGQYSF DAIMTGDYTL RVTNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SRLQVKDSTI YVGDSWKPEE NFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDEG KEETAYVTVK PDQSKLEVKD TTIYVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KIGDYEIVYK NGXKEAKAIV HVRDDSQLEV 
KDTTIYVGDS WKPEDNFVSA TDKTGQDVPF EKITVSGQVD TSKAGVYPIV YSYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF VSATDKTGQD VPFEKIDVQG TVWDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI YVGDKWEAED NFVSATDKTG QDVPFEKIDV 
QGTVNVDKIG DYE I VYKNGT KEAKA I VHVR DDSRLQVKDT TIYVGDSWXP EXNFVSATDK 
TGQDVPF 

EF105-1 (SEQ ID NO:405) 

TAAATGAAAA AAACAGTCGT CTACTCCTTG TTATTCGGAA CAATGTTGCT TGGCGCCACT 
GTTCCTGCTG AAGCGGCGAC GGTCGTTTTT GATAGCGAAC AGTCGATTGT TTTTACCCCA 
AGCACAGATG GGACGGATCC AGTAAATCCA GAAAATCCCG ATCCAGAAAA ACCAGTTCGA 
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CCAGTCGATC CAACGAATCC TGATGGACCT AATCCAGGTA CCCCTGGTCC ACTTTCCATC 
GATTATGCCT CAAGTTTGGA TTTTGGGAGT AATGAGATAT CGAATAAGGA TCAAACGTAT 
TTTGCCAGAG CGCAAACCTA TAGAAATCCA GATGGTTCAG CAAGTGAATT GGCAACTGCT 
AATTATGTAC AAGTAAGTGA TTTACGGGGA ACCAATGCTG GCTGGGTTTT AAAAGTGAAA 
CAAAATGGTC AATTTCGTAA TGCAGAAACA TTACACAAAG AATTAACAGG CGCCACCGTC 
GCCTTTACTG AGCCCAGTGT TCGCTCAAAT GCGACGGACG TATTGCCGCC AACTGCTACC 
GCAAACATTC AATTAGATGC TGCGGGCGCA GAAACTGTTG TCATGCAAGC CCCAGAAAAG 
ACCGGCGCCG GAACGTGGAT CACGCTGTGG GGGCAAGCAG AAAAAGTGAC CGAAAAAAAT 
CAACAAGGAC AGCAAGTAAA TGCCACAATC ACACGGGCAA TCTCACTAAC TGTTCCTGGG 
AAAACCCCTA AGGATGCAGT ACAATATAAA ACAACATTGA CTTGGCTACT TTCAGATGTA 
CCAGTAAATA ATGGAGGGAA ATAA 



EF105-2 (SEQ ID NO:406) 

MKKTWYSLL FGTMLLGATV PAEAATWFD 
VDPTNPDGPN PGTPGPLSID YASSLDFGSN 
YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL 
NIQLDAAGAE TWMQAPEKT GAGTWITLWG 
TPKDAVQYKT TLTWLLSDVP VNNGGK 

EF105-3 (SEQ ID NO:407) 



SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP 
EISNKDQTYF ARAQTYRNPD GSASELATAN 
HKELTGATVA FTEPSVRSNA TDVLPPTATA 
QAEKVTEKNQ QGQQVNATIT RAISLTVPGK 



GGCGAC GGTCGTTTTT GATAGCGAAC AGTCGATTGT TTTTACCCCA 

AGCACAGATG GGACGGATCC AGTAAATCCA GAAAATCCCG ATCCAGAAAA ACCAGTTCGA 
CCAGTCGATC CAACGAATCC TGATGGACCT AATCCAGGTA CCCCTGGTCC ACTTTCCATC 
GATTATGCCT CAAGTTTGGA TTTTGGGAGT AATGAGATAT CGAATAAGGA TCAAACGTAT 
TTTGCCAGAG CGCAAACCTA TAGAAATCCA GATGGTTCAG CAAGTGAATT GGCAACTGCT 
AATTATGTAC AAGTAAGTGA TTTACGGGGA ACCAATGCTG GCTGGGTTTT AAAAGTGAAA 
CAAAATGGTC AATTTCGTAA TGCAGAAACA TTACACAAAG AATTAACAGG CGCCACCGTC 
GCCTTTACTG AGCCCAGTGT TCGCTCAAAT GCGACGGACG TATTGCCGCC AACTGCTACC 
GCAAACATTC AATTAGATGC TGCGGGCGCA GAAACTGTTG TCATGCAAGC CCCAGAAAAG 
ACCGGCGCCG GAACGTGGAT CACGCTGTGG GGGCAAGCAG AAAAAGTGAC CGAAAAAAAT 
CAACAAGGAC AGCAAGTAAA TGCCACAATC ACACGGGCAA TCTCACTAAC TGTTCCTGGG 
AAAACCCCTA AGGATGCAGT AC 

EF105-4 (SEQ ID NO:408) 

ATWFD SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP 

VDPTNPDGPN PGTPGPLSID YASSLDFGSN EISNKDQTYF ARAQTYRNPD GSASELATAN 
YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL HKELTGATVA FTEPSVRSNA TDVLPPTATA 
NIQLDAAGAE TWMQAPEKT GAGTWITLWG QAEKVTEKNQ QGQQVNATIT RAISLTVPGK 
TPKDAV 

EF106-1 (SEQ ID NO:409) 

TAGTCGTTTA TGAAGAAAAA AATCGTTGGT ACAATTACGT TGTTGGCTTT AAGTGCGTTA 
TTAGTTGGTG GAGCAGGAGG GGCTTTGACG GCAGAAGCAT ACGTTCCTCA AAGCGTAGAC 
AATCCCAATA ATTTAGGGGA TTTACCTGAG TATTTACGTT CAGTTGGTAT TAGACAAGAT 
GAAGGATTAT CAGAAAAAGA TTGGGCTGGA ACACGCGTTT ATGATCGAAA TGGGAATGAC 
TTAACAGATG AAAATCAAAA CCTATTACAT GCAATCAAAT TTGATGCAAC CACTAGTTTC 
TATGAATTTT TTGATAAAGA GACTGGAGAA TCAACAGGAG ATGAAGGAAC CTTCTTTATG 
ACCGCTGGTA TTACAGATGT TTCCCGTCTT GTAATTATTT CTGAAACCAA AAATTATCAA 
GGTGTATACC CACTTAGAAC TTTATACCAA GATACTTTTA CGTATAGACA GATGGGGAAA 
GATAAAAACG GAAATGATAT TGAAGTTTTC GTAGAAAACA AAGCAACCTC AGGACCAGTT 
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TATGGTCGTC CGCAGCCATA CCCCAATAAT 

CGCCGTGCCA TGACAGAACA AACAGGCCAG 

ATTGGTAAAA CTTCCTTTGA TGGGACACCG 

GATAAAGATG GCAATGACGT AACTTCGGCC 

GACCAAGATA GCAGCAAATA TGAATTTTTC 

TATGGCTACT TTAAAGTAGG AAATCAAAAT 

AATCGCTATG GCGCTG.TCTT AGAGTTAACA 

CGAATGGGTA AAGATAACGA AGGAAACGAT 

CAAGGAACTT TTAATCCTGA ATTTACCTTT 

EF106-2 (SEQ ID NO:410) 



CGTCCCAGAA CACTAGAATT CACGAATGGA 
ATTGATGTAA ATCGACAAGG GGATGAAATT 
CAACTTCTTT GGAATGGCAC AAAAGTAGTG 
AACCAAAACT TTATCAGCTT AGCGAAATTT 
AATTTACAAA CTGGTGAAAC TCGTGGCGAC 
AAATTCCGTG CCCATGTTTC CATTGGAACC 
GAATTGAATG ATAATCGTTT TACGTACACA 
ATCCAAGTCT ATGTGGAACA TGAACCATAC 
TAA 



MKKKIVGT ITLLALSALL VGGAGGALTA EAYVPQSVDN PNNLGDLPEY LRSVGIRQDE 
GLSEKDWAGT RVYDRNGNDL TDENQNLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT 
AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMGKD KNGNDIEVFV ENKATSGPVY 
GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI DVNRQGDEII GKTSFDGTPQ LLWNGTKWD 
KDGNDVTSAN QNFISLAKFD QDSSKYEFFN LQTGETRGDY GYFKVGNQNK FRAHVSIGTN 
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI QVYVEHEPYQ GTFNPEFTF 



EF106-3 (SEQ ID NO:411) 



AT ACGTTCCTCA AAGCGTAGAC 
AATCCCAATA ATTTAGGGGA TTTACCTGAG 
GAAGGATTAT CAGAAAAAGA TTGGGCTGGA 
TTAACAGATG AAAATCAAAA CCTATTACAT 
TATGAATTTT TTGATAAAGA GACTGGAGAA 
AC CGCTGGTA TTACAGATGT TTCCCGTCTT 
GGTGTATACC CACTTAGAAC TTTATACCAA 
GATAAAAACG GAAATGATAT TGAAGTTTTC 
TATGGTCGTC CGCAGCCATA CCCCAATAAT 
CGCCGTGCCA TGACAGAACA AACAGGCCAG 
ATTGGTAAAA CTTCCTTTGA TGGGACACCG 
GATAAAGATG GCAATGACGT AACTTCGGCC 
GACCAAGATA GCAGCAAATA TGAATTTTTC 
TATGGCTACT TTAAAGTAGG AAATCAAAAT 
AATCGCTATG GCGCTGTCTT AGAGTTAACA 
CGAATGGGTA AAGATAACGA AGGAAACGAT 
CAAGGAACTT 

EF106-4 (SEQ ID NO:412> 

YVPQSVDN PNNLGDLPEY LRSVGIRQDE 
GLSEKDWAGT RVYDRNGNDL TDENQNLLHA 
AGITDVSRLV IISETKNYQG VYPLRTLYQD 
GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI 
KDGNDVTSAN QNFISLAKFD QDSSKYEFFN 
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI 



TATTTACGTT CAGTTGGTAT TAGACAAGAT 
ACACGCGTTT ATGATCGAAA TGGGAATGAC 
GCAATCAAAT TTGATGCAAC CACTAGTTTC 
TCAACAGGAG ATGAAGGAAC CTTCTTTATG 
GTAATTATTT CTGAAACCAA AAATTATCAA 
GATACTTTTA CGTATAGACA GATGGGGAAA 
GTAGAAAACA AAGCAACCTC AGGACCAGTT 
CGTCCCAGAA CACTAGAATT CACGAATGGA 
ATTGATGTAA ATCGACAAGG GGATGAAATT 
CAACTTCTTT GGAATGGCAC AAAAGTAGTG 
AACCAAAACT TTATCAGCTT AGCGAAATTT 
AATTTACAAA CTGGTGAAAC TCGTGGCGAC 
AAATTCCGTG CCCATGTTTC CATTGGAACC 
GAATTGAATG ATAATCGTTT TACGTACACA 
ATCCAAGTCT ATGTGGAACA TGAACCATAC 



IKFDATTSFY EFFDKETGES TGDEGTFFMT 
TFTYRQMGKD KNGNDIEVFV ENKATSGPVY 
DVNRQGDEII GKTSFDGTPQ LLWNGTKWD 
LQTGETRGDY GYFKVGNQNK FRAHVSIGTN 
QVYVEHEPYQ GT 



EF107-1 (SEQ ID NO:413) 

TAAAAAACGG CACTCAATAT GTCAAAATTT 
ATANATANAA AAATGCTAGT TATCAGTATC 
CTTTATAGAG ACTATAGATT GAATTTTTAC 
AATTGGAAAA GATGGCTAGT TGTTGGGTTA 



GAAATTTCAA GCTGTGTGTT CTTTGGTAAA 
GATAATAACA GGATACTGAT TAAGAAAGGA 
ATAGAAAGAA GGAGCAAGAT GAAGCGAGTA 
AGTTGTTCTT TGTTCATGGA TTCAGTGGTT 
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GGTGTGACTG TGTTAGCGGA AACGATTACT GGGGCGACGG AGCAAGGAGT AGCAACATCT 
CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTCACA GGCAACGGTC 
GCTAGTGAAG CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGCTTC TCGAGCGATT 
GGTTATTCTT CTGTGGAAGG GCGCGAG ATT CCCTTTTTCT TTGTGGAGGA AGACGGGACG 
TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA 
GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT 
ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC AGAATCCAAG TAATATTTAT 
GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATC AAAGCCTGGA CATTCGACCA 
AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTTTT TGCGAACGTT 
GGGGGCAAAG TGACGAACTA TGTGGGCGCC AATGCGGAGA CGGAATTAGA GTTAACCAAT 
GAAAAAATGC CCAATAAGCT GACGTTTGGT C C TAAAAAG A CGTTTAAATA TACGGTAGCT 
ACGGCACCAG GAGGCGTTAC GTATGCGCTG ACCTATTTTT ATGGAGATGT CGGCGGTCCA 
ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTATTT AACAAAGCGG 
CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGCC AGAAGGTTAT 
ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC 
TTGCCTGAAC GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA 
GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC CTAGTTATGC GGTGACCTAT 
GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAAC CTATACGTTG 
CCAGCGAGAG AAGCTTTGTT CGGC TATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC 
AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG CCACAGGGGA AATGACGACT 
TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG 
CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGCCC GCAAGAAGTG 
AG TGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAACTAC TG CGTATAC A 
GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATCGA AAACCCCGAC 
AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCAAAA AAGTGCCACA 
CGCTATACTG CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGACACT TTACTCAATG 
AGTATC TATT CAGGAACTGC TGGGGGCAAC TATAATTTAT CGACCCCTGA TGGCACCATT 
TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA 
ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGACAG TGAAAACTAT 
GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGAAAA AACCTATATC 
TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGAC AACGACCCCA 
AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACCA AGAAGCGATT 
CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATGG CGCCACAATG 
GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAACCAC CATTAAAATC 
AAGCCAACGG CAACTTGGGC GGCTGGCATC GGCGCACCCA ACACGATATT TGTACAAGGA 
ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA 
GTGTCCATCA CGTTGGATCA GCCTTTACCA GCTGGCGGTC AATTAAAAAT GAACTTATTA 
GGAACCGCCG TTACAGGAAA TCCTGGTCAA GTTTTAAGCG CTGATGTTGA AGTAACGGGC 
AACTTTGGCA GTTTAACTGC CAAAGATACG GTCCGTATTA AAGACTTAGA TCAAGAAATT 
ACGAGTCCTG ACGGCGACGG CTTTATTAGT ACCCCGACAT TTGATTTTGG TAAACTAGCA 
ATTTCAGGAA GTAAGCAACA ATATGGTTTG AAGAAGGCCG CAGATTACTA CGGCAATGGC 
ACTCGCAACC CTTATTTACG CCTGAATACT AGCCAAGCCA ATTGGAGTTT -AACGGCCCAG 
CTATCGCAAC CAAAATCAGC CACAGACAGC TTGCCAACAA CGACCCGCTT GTTGCTAGGA 
ACGGCCGCTG CTGCCAGCTT TACCGATTAC AACCAACCAA CAGAAACCAG GACACCACTT 
GGCAAGACCA GCACCGTGAC TTTAACCGCC GACAATACCG CAACAGCGGT GGTCGCAAAC 
CAACAGTTCA CAGGCAGTGA CGTCTATCAG TTGGACTTCA CGTTTGCTAA CATCAAACTA 
GAAGTGCCAG CCAACCAAGG TATGGCTGGC CAACAATACC AAGCCGCCGT CACGTGGAAT 
TTAGTGACTG GCCCCTAA 



EF107-2 (SEQ ID NO:414) 
MKRVN 

WKRWLWGLS CSLFMDSWG VTVLAETITG 
SEAKTVPPQE TARIASRAIG YSSVEGRE.IP 



ATEQGVATSQ SSDEASQTTQ TTEESQATVA 
FFFVEEDGTL FDPDRITMAV NLSTFSFYEE 
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KLQRTPLEPT TVNGGKLLS I PTSPAFKYDT NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS 
TFYTGDTTQY PVPTVFANVG GKVTNYVGAN AETELELTNE KMPNKLTFGP KKTFKYTVAT 
APGGVTYALT YFYGDVGGPT SSHQRRGTAG PVYYYLTKRR VTEKFENPAG GAIPAPEGYT 
QDKKTIVTGE DFTFTQEGTL PERYTGSDGK TYLFKGWYKG NAKPSTLETT KTPSYAVTYD 
DNDDLHWYE EAVMKTYTLP AREALFGYVD EQGNLINPAK FKLSATMGES DGATGEMTTF 
PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT IVTYGPQEVS VEIPKYYQTI SISPTTAYTG 
DKTKYPVPNE VRRGIENPDN IVSSLVGXXA YNLTQKSATR YTARRSYWXW GPTKTLYSMS 
IYSGTAGGNY NLSTPDGTIY YYLENRRVTE HFVDESGAKI TPPTGFTQGN QLWDSENYV 
YTVAKALPK I YQAGEKTYIF QGWFKGKTKP ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP 
TAELTLTGAV DIIENGATMD YWEALLKNTG EAPLTTIKIK PTATWAAGIG APNTIFVQGT 
GQNTKAFPVT KEQWTTGAGV SITLDQPLPA GGQLKMNLLG TAVTGNPGQV LTADVEVTGN 
FGSLTAKDTV RIKDLDQEIT SPDGDGFIST PTFDFGKLAI SGSKQQYGLK KAADYYGNGT 
RNPYLRLNTS QANWSLTAQL SQPKSATDSL PTTTRLLLGT AAAASFTDYN QPTETRTPLG 
KTSTVTLTAD NTATAWANQ QFTGSDVYQL DFTFANIKLE VPANQGMAGQ QYQAAVTWNL 
VTGP 

EF107-3 (SEQ ID NO:415) 
GG AGCAAGGAGT AGCAACATCT 

CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTCACA GGCAACGGTC 
GCTAGTGAAG CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGCTTC TCGAGCGATT 
GGTTATTCTT CTGTGGAAGG GCGCGAGATT CCCTTTTTCT TTGTGGAGGA AGACGGGACG 
TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA 
GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT 
ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC AGAATCCAAG TAATATTTAT 
GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATC AAAGCCTGGA CATTCGACCA 
AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTTTT TGCGAACGTT 
GGGGGC AAAG TGACGAACTA TGTGGGCGCC AATGCGGAGA CGGAATTAGA GTTAACCAAT 
GAAAAAATGC CCAATAAGCT GACGTTTGGT CCTAAAAAGA CGTTTAAATA TACGGTAGCT 
ACGGC AC C AG GAGGCGTTAC GTATGCGCTG ACCTATTTTT ATGGAGATGT CGGCGGTCCA 
ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTATTT AACAAAGCGG 
CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGCC AGAAGGTTAT 
ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC 
TTGCCTGAAC GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA 
GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC CTAGTTATGC GGTGACCTAT 
GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAAC CTATACGTTG 
CCAGCGAGAG AAGCTTTGTT CGGCTATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC 
AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG CCACAGGGGA AATGACGACT 
TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG 
CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGCCC GCAAGAAGTG 
AGTGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAACTAC TGCGTATACA 
GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATCGA AAACCCCGAC 
AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCAAAA AAGTGCCACA 
CGCTATACTG CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGACACT TTACTCAATG 
AGTATCTATT CAGGAACTGC TGGGGGCAAC TATAATTTAT CGACCCCTGA TGGCACCATT 
TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA 
ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGACAG TGAAAAC TAT 
GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGAAAA AACCTATATC 
TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGAC AACGACCCCA 
AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACCA AGAAGCGATT 
CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATGG CGCCACAATG 
GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAACCAC CATTAAAATC 
AAGCCAACGG CAACTTGGGC GGCTGGCATC GGCGCACCCA ACACGATATT TGTACAAGGA 
ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA 



WO 98/50554 



PCT/US98/08959 



208 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

GTGTCCATCA CGTTGGATCA GCCTTTACCA GCTGGCGGTC AATTAAAAAT GAACTTATTA 
GGAACCGCCG TTACAGGAAA TCCTGGTCAA GTTTTAACCG CTGATGTTGA AGTAACGGGC 
AACTTTGGCA GTTTAACTGC CAAAGATACG GTCCGTATTA AAGACTTAGA TCAAGAAATT 
ACGAGTCCTG ACGGCGACGG CTTTATTAGT ACCCCGACAT TTGATTTTGG TAAACTAGCA 
ATTTCAGGAA GTAAGCAACA ATATGGTTTG AAGAAGGCCG CAGATTACTA CGGCAATGGC 
ACTCGCAACC CTTATTTACG CCTGAATACT AGCCAAGCCA ATTGGAGTTT AACGGCCCAG 
CTATCGCAAC CAAAATCAGC CACAGACAGC TTGCCAACAA CGACCCGCTT GTTGCTAGGA 
ACGGCCGCTG CTGCCAGCTT TACCGATTAC AACCAACCAA CAGAAACCAG GACACCACTT 
GGCAAGACCA GCACCGTGAC TTTAACCGCC GACAATACCG CAACAGCGGT GGTCGCAAAC 
CAACAGTTCA CAGGCAGTGA CGTCTATCAG TTGGACTTCA CGTTTGCTAA CATCAAACTA 
GAAGTGCCAG CCAACCAAGG TATGGCTGGC CAACAATACC AAGCCGCCGT CACGTGGAAT 
TTAGTGACTG GCCCCT 

EF107-4 (SEQ ID NO:416) 

EQGVATSQ SSDEASQTTQ TTEESQATVA 

SEAKTVPPQE TARIASRAIG YSSVEGREIP FFFVEEDGTL FDPDRITMAV NLSTFSFYEE 
KLQRTPLEPT TVNGGKLLSI PTSPAFKYDT NNQNPSNIYG VSEVSFTIPK EYQSLDIRPS 
TFYTGDTTQY PVPTVFANVG GKVTNYVGAN AETELELTNE KMPNKLTFGP KKTFKYTVAT 
APGGVTYALT YFYGDVGGPT SSHQRRGTAG PVYYYLTKRR VTEKFENPAG GAIPAPEGYT 
QDKKTIVTGE DFTFTQEGTL PERYTGSDGK TYLFKGWYKG NAKPSTLETT KTPSYAVTYD 
DNDDLHWYE EAVMKTYTLP AREALFGYVD EQGNLINPAK FKLSATMGES DGATGEMTTF 
PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT IVTYGPQEVS VEIPKYYQTI SISPTTAYTG 
DKTKYPVPNE VRRGIENPDN IVSSLVGXXA YNLTQKSATR YTARRSYWXW GPTKTLYSMS 
IYSGTAGGNY NLSTPDGTIY YYLENRRVTE HFVDESGAKI TPPTGFTQGN QLWDSENYV 
YTVAKALPKI YQAGEKTYIF QGWFKGKTKP ATLKTTTTPS FTPTFNDEDD MTAVYQEAIP 
TAELTLTGAV DIIENGATMD YWEALLKNTG EAPLTTIKIK PTATWAAGIG APNTIFVQGT 
GQNTKAFPVT KEQWTTGAGV SITLDQPLPA GGQLKMNLLG TAVTGNPGQV LTADVEVTGN 
FGSLTAKDTV RIKDLDQEIT SPDGDGFIST PTFDFGKLAI SGSKQQYGLK KAADYYGNGT 
RNPYLRLNTS QANWSLTAQL SQPKSATDSL PTTTRLLLGT AAAASFTDYN QPTETRTPLG 
KTSTVTLTAD NTATAWANQ QFTGSDVYQL DFTFANIKLE VPANQGMAGQ QYQAAVTWNL 
VTGP 

EF108-1 (SEQ ID NO:417) 

TAATCGGTTT GGCGGGAATC GTACATAGAA AGAAGGGACG . ACATGAAGGA AACTAAGTGG 
CAACGATTAG CAACCATTGG CTTGTGTAGT TCTTTAGTAA TTAACGCCTT TTCTGGTGTG 
ACGGCAGTTG CGGAAACCGT GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG 
GAAGAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT 
CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG 
CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG 
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT 
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG 
ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA 
AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG ACCGTCATAT 
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA 
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG 
ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA 
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC 
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC 
GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG 
AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG 
ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAAGG GAAAACCAAA 
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ACCGAGCCTT TGGCCACCAC TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT 
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG 
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG 
ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC 
GTCAATACGT TTTATGGCGC GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA 
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT 
AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG 
CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAATT TTATAC G AAA 
ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCCAGTCA ATTATTATAC AATGAGTGGC 
CCTGTTTACT ATTATTTAGA AAACCGCAAA GTCACCGAGA ACTTCGTAGA CACCAACGGC 
GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC 
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC 
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA 
GCGCC AAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT GTATGAAGAA 
GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG 
GCTTTCACAC CGG GGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC 
TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA 
AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATTAGC CATCGACCAA 
CAACTAAAAT ATGTTGACAG CATTCAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC 
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG ACCCAAATGT AGCACCAGCA 
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT 
TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG TGGATGTCAA CGGTGCCAAA 
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC 
AAGTACACAG CGGCAAAAGC TTTACCAGCG ACGTATACTA CAGGTGGCAA AGTCTATACG 
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA 
ATCTGGACAA CAACGATCAC GAATAC TAGC AAAGCACCCT TACAAAATCT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA 
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG 
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTCACA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC 
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT 
AGCTTAACAG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT 
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AGGTCCTTAA 

EF108-2 (SEQ ID NO:418) 

MKQTKWQ RLATIGLCSS LVINAFSGVT AVAETVTIES SPTAESSAKE 
ETQASSVKEE TTKASTENSQ VTTDTSQEEA TKEAEKEEPQ AEVEQAETPI IPKPKKINMK 
ATYSFSAETY QFGFVNESGQ LINPDIIPIT YSYAKGSWKT DGYNRKWTSM VQGSASTVGN 
LKNVIMPATS WMPPGPSYE GTQEVYTNFS IRIPKYYASA SLYNREGKID STYPLPAIAL 
AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG VKFTNFLYRI NDFDVKGNNI 
GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI TPPSNFTQGK QTVINSDPYT FQQSGFLPET 
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YKVGTKSYRF KGWYKGKTKT EPLATTKTPS YKVTYDDNDD LTWYEEFSG YELPASTNQF 
GFVDEATNKL IAPDQVQMKY NLTLNENNKK TVMSSNLTGT .DTATLKNLSV PVNYFEQYRV 
NTFYGASDIT FTLPKRYKSI NITKSDGKTD PAFPLPKIYN IDQVEMSHMP VTTYNKLKQL 
SGQTFGFNAL ADQPEFYTKT LFGTESGIDD PVNYYTMSGP VYYYLENRKV TENFVDTNGA 
KITPPTGFTQ GKKTVITSDA YTFKQAGTLP DTYTTGGKTY KFKGWYKGKS ILNTLTTTKA 
PSYQVTYDDN DDLNWYEEE TVTTVYPSVD MNFVNEKGGA FTPALTFSGK YYAQSTSAYL 
RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ ELLKKYNNGQ PISATNRLQF NVDKLAIDQQ 
LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL VFDPNVAPAE VDLSSESLNL LNFDSDGTYF 
SNANNRLFYT HLGYSGTPGV NYLLVMFLFN AKPADKSKLV YKVTRKQVTE NFVDVNGAKI 
TAPTGFTQGN QVPMNSNTFK YTAAKALPAT YTTGGKVYTF QGWYKGKTKP STLNKTTTPT 
FNATFDGNDD MTAMYKEEIP TASVTLTRPK EVIDTNTNVI WTTTITNTSK APLQNLTLKK 
GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS TLWTEGVPLP NAVPIGKKVS VAFTTRATGK 
PNTVLKAEW VFGGIKDSTV DNFVRIRPND QEWTPTTEG FISVPTFDFG QVGVAGTKQQ 
HSLKQAADYY GNGTRNPYLR IKKTQPNWSL TAQLSQPKSA TDSLPTATRL LLGAAPVSSF 
TNYNQPTELK NTVGTTSAIS LTANNTATS I IANKQFTGSN VYQLDFTFNN VKLEVPANQG 
VKGQQYKAAV TWNLVTGP 

EF108-3 (SEQ ID NO:.419) 

CGT GACGATTGAA AGTAGTC CGA CCGCCGAAAG TAGTGCCAAG 

GAAGAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAGTAC GGAAAATAGT 
CAAGTAACAA CTGACACGAG TCAGGAAGAA GCAACGAAAG AAGCGGAGAA AGAAGAACCG 
CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAAAAA AATCAATATG 
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTG GATTTGTGAA TGAATCAGGT 
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAGCT ATGCCAAAGG ATCATGGAAG 
ACAGATGGTT ATAATCGAAA GTGGACTAGT ATGGTTCAAG GGAGTGCTTC AACCGTAGGA 
AACTTAAAGA ATGTAATAAT GCCAGCAACT TCTGTAGTTA TGCCACCAGG ACCGTCATAT 
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGCA TACCAAAATA TTATGCATCA 
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAGTGC ATTGGCGCTG 
ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGAGG AGATCCTGCA 
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA AGGTAATAAC 
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG CCGTGTCACC 
GAAAACTTCG TAGATACAAG TGGCGCCAAA ATCACGCCAC CAAGTAATTT CACCCAAGGG 
AAACAAACGG TCATTAACAG TGATCCTTAC ACGTTCCAAC AAAGTGGTTT TTTACCCGAG 
ACCTACAAAG TTGGCACGAA ATCTTACCGA TTCAAAGGCT GGTACAAAGG GAAAACCAAA 
ACCGAGCCTT TGGCCACCAC TAAAACACCT AGCTATAAAG TCACGTATGA TGACAATGAT 
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAAGT GCAGATGAAG 
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGTAA CTTAACGGGG 
ACAGATACAG CGACACTGAA AAACTTGTCC GTGCCTGTCA ACTATTTTGA ACAATATCGC 
GTCAATACGT TTTATGGCGC GAGTGACATT ACGTTTACAT TGCCCAAACG GTACAAATCA 
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTTCC TAAAATCTAT 
AATATAGATC AAGTAGAAAT GTCACACATG CCTGTGACCA CTTATAACAA GTTGAAACAG 
CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAATT TTATACGAAA 
ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCC AGTCA ATTATTATAC AATGAGTGGC 
CCTGTTTACT ATTATTTAGA AAACCGCAAA GTCACCGAGA ACTTCGTAGA CACCAACGGC 
G C TAAAATC A CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC 
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC 
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA ACACATTGAC AACTACCAAA 
GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGTGGT GTATGAAGAA 
GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG 
GCTTTCACAC CGGCGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAGTAC GAGTGCGTAC 
TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGGCA ATATACGGTA 
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AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAATA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATTAGC CATCGACCAA 
CAACTAAAAT ATGTTGACAG CATTCAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC 
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG AC CC AAATGT AGCACCAGCA 
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCAGA TGGCACCTAT 
- TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAGTGG CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAAC CTGCGGATAA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG TGGATGTCAA CGGTGCC AAA 
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAACAG TAACACCTTC 
AAGTACACAG CGGCAAAAGC TTTACCAGCG ACGTATACTA CAGGTGGCAA AGTCTATACG 
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTATAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACGAA TACCAATGTA 
ATCTGGACAA CAACGATCAC GAATACTAGC AAAGCACCCT TACAAAATCT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAAGT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGAGGG GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAACTCG CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTTG GTGGTATTAA AGATAGTACA 
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAAG TAGTCACACC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCAGG AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG 
CGGATTAAGA AAACGCAACC GAATTGGAGC TTAAC AG CGC AACTGTCACA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AGCGACCCGC TTATTATTAG GGGCGGCGCC TGTCTCTAGC 
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACCAC GAGTGCCATT 
AGC TTAAC AG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAATT CACAGGTAGT 
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGTGCC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAAGGCCGCA GTTACATGGA ACCTAGTTAC AG 

EF108-4 (SEQ ID NO: 420) 

VTIES SPTAESSAKE 

ETQASSVKEE TTKASTENSQ VTTDTSQEEA TKEAEKEEPQ AEVEQAETPI IPKPKKINMK 
ATYSFSAETY QFGFVNESGQ LINPDIIPIT YSYAKGSWKT DGYNRKWTSM VQGSASTVGN 
LKNVIMPATS WMPPGPSYE GTQEVYTNFS IRIPKYYASA SLYNREGKID STYPLPAIAL 
AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG VKFTNFLYRI NDFDVKGNNI 
GYKTVSSPIY YHLTNRRVTE NFVDTSGAKI TPPSNFTQGK QTVINSDPYT FQQSGFLPET 
YKVGTKSYRF KGWYKGKTKT EPLATTKTPS YKVTYDDNDD LTWYEEFSG YELPASTNQF 
GFVDEATNKL IAPDQVQMKY NLTLNENNKK TVMSSNLTGT DTATLKNLSV PVNYFEQYRV 
NTFYGASDIT FTLPKRYKSI NITKSDGKTD PAFPLPKIYN IDQVEMSHMP VTTYNKLKQL 
SGQTFGFNAL ADQPEFYTKT LFGTESGIDD PVNYYTMSGP VYYYLENRKV TENFVDTNGA 
KITPPTGFTQ GKKTVITSDA YTFKQAGTLP DTYTTGGKTY KFKGWYKGKS ILNTLTTTKA 
PSYQVTYDDN DDLNWYEEE TVTTVYPSVD MNFVNEKGGA FTPALTFSGK YYAQSTSAYL 
RTDLYDVTSK NNGNGQYTVS INNGSMPLSQ ELLKKYNNGQ PISATNRLQF NVDKLAIDQQ 
LKYVDSIQLD TAQSSNLKSY RYVYTNNSSL VFDPNVAPAE VDLSSESLNL LNFDSDGTYF 
SNANNRLFYT HLGYSGTPGV NYLLVMFLFN AKPADKSKLV YKVTRKQVTE NFVDVNGAKI 
TAPTGFTQGN QVPMNSNTFK YTAAKALPAT YTTGGKVYTF QGWYKGKTKP STLNKTTTPT 
FNATFDGNDD MTAMYKEEIP TASVTLTRPK EVIDTNTNVI WTTTITNTSK APLQNLTLKK 
GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS TLWTEGVPLP NAVPIGKKVS VAFTTRATGK 
PNTVLKAEW VFGGIKDSTV DNFVRIRPND QEWTPTTEG FISVPTFDFG QVGVAGTKQQ 
HSLKQAADYY GNGTRNPYLR IKKTQPNWSL TAQLSQPKSA TDSLPTATRL LLGAAPVSSF 
TNYNQPTELK NTVGTTSAIS LTANNTATSI IANKQFTGSN VYQLDFTFNN VKLEVPANQG 
VKGQQYKAAV TWNLVT 



EF109-1 (SEQ ID NO:421) 
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AGGAGTAAAT TAATGAAAAA AAGTGTTATA 
GGATTTCTCG TTACCCCTAT TTCTGCTTAC 
GAAACGGTGG CTTCAGAAAC ATCTCTAACG 
GAAATGAACC CAAGCATCAT AAATTCTCAA 
ACCTCCGATT CCACCACTGA AGTTTCTACA 
NATAGTAGCG ACGTACTGAA ACTACTTTGG 
TAG 



ACTAGTTCTA TGTTAGCGGT TTTGTTGTCG 
GCTTTGGAAC GCTCTAAGGG AACTACTGAA 
GAGCGACAAA TGAGTAGCGG TGTCACTGAA 
GAGGAAACAG AAACAACGTC CACTTCCTCA 
TCAGAAGTAA CAACTGTTAA TGATACAGAA 
NAACATCACN AAGTAATGAG GACACACCTA 



EF109-2 (SEQ ID NO:422) 

MKKSVI TSSMLAVLLS GFLVTPISAY ALERSKGTTE ETVASETSLT ERQMSSGVTE 
EMNPSIINSQ EETETTSTSS TSDSTTEVST SEVTTVNDTE XSSDVLKLLW XHHXVMRTHL 

EF109-3 (SEQ ID NO:423) 

GGAAC GCTCTAAGGG AACTACTGAA 

GAAACGGTGG CTTCAGAAAC ATCTCTAACG GAGCGACAAA TGAGTAGCGG TGTCACTGAA 
GAAATGAACC CAAGCATCAT AAATTCTCAA GAGGAAACAG AAACAACGTC CACTTCCTCA 
ACCTCCGATT CCACCACTGA AGTTTCTACA TCAG 

EF109-4 (SEQ ID NO: 424) 

ERSKGTTE ETVASETSLT ERQMSSGVTE EMNPSIINSQ EETETTSTSS TSDSTTEVST. S 
EF110-1 (SEQ ID NO:425) 

TAAATAAAAA TGGATAAGGA GTGGCATAAT CTTATGAAAA AGTTCTCCAT ACGAAAAATT 
AGTGCTGGTT TTTTGTTTCT GATTTTAGTA ACTTTGATCG CCGGTTTTAG CTTGTCTGCA 
AATGCAGAAG AGTATATCGT TCCTGCCGAA AGTCATTCAC GACAAAAAAG ATCGTTACTG 
GACCCTGAGG ACAGAAGACA AGAAGTGGCA GATACAACCG AAGCGCCTTT TGCGTCAATC 
GGAAGAATCA TTTCCCCTGC CAGTAAACCA GGCTATATTT CTTTAGGAAC AGGCTTTGTT 
GTTGGAACCA ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA 
GTATTAAATC CGAATGCCAA AGATGATGCT TGGTTTTATC CAGGTCGAGA TGGCAGTGCG 
ACACCATTTG GCAAATTCAA AGTGATTGAT GTAGCTTTTT CCCCGAATGC GGATATTGCG 
GTAGTGACTG TCGGCAAACA AAACGATCGT CCAGATGGCC CAGAGTTGGG AGAAATTTTA 
ACGCCATTTG TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCACAAT ATCAGGCTAT 
CCAGGTGAGA AAAACCACAC ACAATGGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT 
ACAGACTTAG AAAATCCATT ACTATTTTAT GATATCGATA CAACCGGCGG TCAATCTGGT 
TCACCAATCT ATAATGATCA GGTTGAAGTA GTTGGTGTTC ATTCCAATGG CGGCATTAAG 
CAAACAGGAA ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA 
GTGAATGAAG AAGAAAATAA ACGTTTATCC GCTGTGCCAG CAGCGTAA 

EF110-2 (SEQ ID NO: 42 6) 

MKKFSIRKIS AGFLFLILVT LIAGFSLSAN AEEYIVPAES HSRQKRSLLD 
PEDRRQEVAD TTEAPFASIG RIISPASKPG YISLGTGFW GTNTIVTNNH VAES FKNAK V 
LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV AFSPNADIAV VTVGKQNDRP DGPELGEILT 
PFVLKKFESS DTHVTISGYP GEKNHTQWSH ENDLFTSNFT DLENPLLFYD IDTTGGQSGS 
PIYNDQVEW GVHSNGGIKQ TGNHGQRLNE VNYNFIVNRV NEEENKRLSA VPAA 

EF110-3 (SEQ ID NO:427) 

AG AGTATATCGT TCCTGCCGAA AGTCATTCAC GACAAAAAAG ATCGTTACTG 
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GACCCTGAGG ACAGAAGACA AGAAGTGGCA GATACAACCG AAGCGCCTTT TGCGTCAATC 
GGAAGAATCA TTTCCCCTGC CAGTAAACCA GGCTATATTT CTTTAGGAAC AGGCTTTGTT 
GTTGGAACCA ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA 
GTATTAAATC CGAATGCCAA AGATGATGCT TGGTTTTATC CAGGTCGAGA TGGCAGTGCG 
ACACCATTTG GCAAATTCAA AGTGATTGAT GTAGCTTTTT CCCCGAATGC GGATATTGCG 
GTAGTGACTG TCGGCAAACA AAACGATCGT CCAGATGGCC CAGAGTTGGG AGAAATTTTA 
ACGCCATTTG TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCACAAT ATCAGGCTAT 
CCAGGTGAGA AAAACCACAC ACAATGGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT 
ACAGACTTAG AAAATCCATT ACTATTTTAT GATATCGATA CAACCGGCGG TCAATCTGGT 
TCACCAATCT ATAATGATCA GGTTGAAGTA GTTGGTGTTC ATTCCAATGG CGGCATTAAG 
CAAACAGGAA ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA 
GTGAATGAAG AAGAAAATAA ACGTTTATCC GCTGTGCCAG CAGCGT 

EF110-4 (SEQ .ID NO:428) 

EYIVPAES HSRQKRSLLD 

PEDRRQEVAD TTEAPFASIG RIISPASKPG YISLGTGFW GTNTIVTNNH VAESFKNAKV 
LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV AFSPNADIAV VTVGKQNDRP DGPELGEILT 
PFVLKKFESS DTHVTISGYP GEKNHTQWSH ENDLFTSNFT DLENPLLFYD IDTTGGQSGS 
PIYNDQVEW GVHSNGGIKQ TGNHGQRLNE VNYNFIVNRV NEEENKRLSA VPAA 

EF111-1 (SEQ ID NO:429) 

TGATCAATAC ACTTCGATAC GGTCGCTTTT TTTCTAGAGA AAGTTGAATC TTTCAATAAT 
AAAAAGGGAT ACACTCCATT TGGCATAGTC CTTGCTGATA ATAAATCAGT GTATAAAGCG 
CTATCATTTT ATAGGAGGGG TTTTATGAAG GGTTTATCAA AAAAGAAACG GGTGTCTACT 
TGGTTAGCGT TAGGAATCAC CGTAGTCAGC TGTTTTGCGT TAAGCAGGGA AGTGCAAGCA 
AGTGTTGAAA GAACAAAAGT TGATGAATTT GCAAATGTTT TAGATGTGAG TGCATCACCA 
ACCGAACGGA CGAATGGCGT ATACGATACC AATTATTTTA ATAATTTTTC TGATTTAGGT 
GCATGGCATG GCTACTATTT ACCTGAAAAA AGCAATAAAG AGCTACTGGG TGGTTTTGCG 
GGGCCATTGA TTATTGCGGA AGAATATCCA GTAAACTTGG CGGCAAGTTT AAACAAATTA 
ACGGTCAAAA ATAAAAAAAC GGG AG AAACC TATGATTTAA GCCAAAGCAA CCGCATGGAC 
CTGTCTTATT ATCCTGGGCG CCTAGAGCAA AC CTATGAAT TAGACGATTT AACGATTCAT 
TTAGCTTTAA TTTTTGTCAG CAATCGAACG GCGCTTATCC AAACGACACT TGAAAACACT 
GGTGAAGAGC CCTTGTCACT TGGAGCAAGC TGGACAGGTG CGGTCTTTGA CAAAATTCAA 
GAGGGAACGG AAACCTTAGA TATTGGCACT CGTTTAACTG CTAAAGACAA TGACATTCAA 
GTGAATTTTG GTGAAGTCAG AGAAACGTGG AATTATTTTG CTACGAAAGA CACAAAATAT 
ACGATTCATC ATGCGGATAA AGTTTCAACA AAAATTGATA ATCGGAATTA TACAGCAACC 
GCTGAACCAA TTGAATTGAA GCCTAAACAA ACGTACAACA CCTATACGAC AGAAAGCTAT 
ACTTTTACAA AAGAAGAAGA GGCAAAGGAA CAACAACAAG CACCCGAATA TACCAAAAAT 
GCGGCGCGCT ATTTCAAAGA GAACAAGCAA AGATGGCAAG GATATCTAGA TAAAACGTTT 
GATCAAAAGA AAACAGCAGA ATTTCCTGAA TATCAAAATG CGCTAGTCAA ATCGATTGAA 
ACGATTAATA CCAATTGGCG AAGTGCGGCA GGTGCCTTTA AGCATGACGG GATTGTTC CG 
TCCATGTCTT ATAAATGGTT TATTGGTATG TGGGCTTGGG ATTCGTGGAA AGCGGATGTA 
GCAACGGCTG ATTTTAATCC TGAGTTAGCT AAAAATAATA TGCGGGCCTT GTTTGATTAT 
CAAATTCAAA AAGATGATAC CGTACGTCCA CAAGATGCAG GAGCGATCAT TGATGCTGTC 
TTTTACAATC AAGACAGTGC GCGTGGTGGT GAAGGTGGCA ACTGGAATGA ACGAAATTCT 
AAACC ACCAT TGGCTGCATG GGCAGTTTGG CATATTTATC AAGAAACCAA AGATAAGGAA 
TTTTTAAAAG AAATGTATCC CAAACTTGTG GCTTATCATA ATTGGTGGTA TACCAACAGA 
GACCACAATA AAAATGGGAT AGCAGAATAT GGAAGCATGG TCAGTGATGC TCACTGGCAA 
AAAGACGACA . AGGATCAAAT CATTAAAGAT AAAAATGGCC ACCTAAAGTG GATGATGATG 
CTGTTATTGA AGCAGCCGCG TGGGAAAGTG GCATGGATAA CGCTACACGG TTTGACAAAG 
AAGGTGTGGG CAAAGGCGAC GTTGGAGTTA AAGTTTTTGA AAACAAAAAT AAAGGAAAAG 
TAG 
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EFlll-2 (SEQ ID NO-.430) 
MKG LSKKKRVSTW 

LALGITWSC FALSREVQAS VERTKVDEFA NVLDVSASPT ERTNGVYDTN YFNNFSDLGA 
WHGYYLPEKS NKELLGGFAG PLIIAEEYPV NLAASLNKLT VKNKKTGETY DLSQSNRMDL 
SYYPGRLEQT YELDDLTIHL ALIFVSNRTA LIQTTLENTG EEPLSLGASW TGAVFDKIQE 
GTETLDIGTR LTAKDNDIQV NFGEVRETWN YFATKDTKYT IHHADKVSTK IDNRNYTATA 
EPIELKPKQT YNTYTTESYT FTKEEEAKEQ QQAPEYTKNA ARYFKENKQR WQGYLDKTFD 
QKKTAEFPEY QNALVKSIET INTNWRSAAG AFKHDGIVPS MSYKWFIGMW AWDSWKADVA 
TADFNPELAK NNMRALFDYQ IQKDDTVRPQ DAGAIIDAVF YNQDSARGGE GGNWNERNSK 
' PPLAAWAVWH IYQETKDKEF LKEMYPKLVA YHNWWYTNRD HNKNGIAEYG SMVSDAHWQK 
DDKDQIIKDK NGHLKWMMML LLKQPRGKVA WITLHGLTKK VWAKATLELK FLKTKIKEK 

EF111-3 (SEQ ID NO:431) 

TGATGAATTT GCAAATGTTT TAGATGTGAG TGCATCACCA 

ACCGAACGGA CGAATGGCGT ATACGATACC AATTATTTTA ATAATTTTTC TGATTTAGGT 
GCATGGCATG GCTACTATTT ACCTGAAAAA AGCAATAAAG AGCTACTGGG TGGTTTTGCG 
GGGCCATTGA TTATTGCGGA AGAATATCCA GTAAACTTGG CGGCAAGTTT AAACAAATTA 
ACGGTCAAAA ATAAAAAAAC GGGAGAAACC TATGATTTAA GCCAAAGCAA CCGCATGGAC 
CTGTCTTATT ATCCTGGGCG CCTAGAGCAA ACCTATGAAT TAGACGATTT AACGATTCAT 
TTAGCTTTAA TTTTTGTCAG CAATCGAACG GCGCTTATCC AAACGACACT TGAAAACACT 
GGTGAAGAGC CCTTGTCACT TGGAGCAAGC TGGACAGGTG CGGTCTTTGA CAAAATTCAA 
GAGGGAACGG AAACCTTAGA TATTGGCACT CGTTTAACTG CTAAAGACAA TGACATTCAA 
GTGAATTTTG GTGAAGTCAG AGAAACGTGG AATTATTTTG CTACGAAAGA CACAAAATAT 
ACGATTCATC ATGCGGATAA AGTTTCAACA AAAATTGATA ATCGGAATTA TACAGCAACC 
GCTGAACCAA TTGAATTGAA GCCTAAACAA ACGTACAACA CCTATACGAC AGAAAGCTAT 
ACTTTTACAA AAGAAGAAGA GGCAAAGGAA CAACAACAAG CACCCGAATA TACCAAAAAT 
GCGGCGCGCT ATTTCAAAGA GAACAAGCAA AGATGGCAAG GATATC TAGA TAAAACGTTT 
GATCAAAAGA AAACAGCAGA ATTTCCTGAA TATCAAAATG CGCTAGTCAA ATCGATTGAA 
ACGATTAATA CCAATTGGCG AAGTGCGGCA GGTGCCTTTA AGCATGACGG GATTGTTCCG 
TCCATGTCTT ATAAATGGTT TATTGGTATG TGGGCTTGGG ATTCGTGGAA AGCGGATGTA 
GCAACGGCTG ATTTTAATCC TGAGTTAGCT AAAAATAATA TGCGGGCCTT GTTTGATTAT 
CAAATTCAAA AAGATGATAC CGTACGTCCA CAAGATGCAG GAGCGATCAT TGATGCTGTC 
TTTTACAATC AAGACAGTGC GCGTGGTGGT GAAGGTGGCA ACTGGAATGA ACGAAATTCT 
AAACCACCAT TGGCTGCATG GGCAGTTTGG CATATTTATC AAGAAACCAA AGATAAGGAA 
TTTTTAAAAG AAATGTATCC CAAACTTGTG GCTTATCATA ATTGGTGGTA TACCAACAGA 
GACCACAATA AAAATGGGAT AGCAGAATAT GGAAGCATGG TCAGTGATGC TCACTGGCAA 
AAAGACGACA AGGATCAAAT CATTAAAGAT AAAAATGGCC ACCTAAAGTG GATGATGATG 
CTGTTATTGA AGCAGCCGCG TGGGAAAGTG GCATGGATAA CGCTACACGG TTTGACAAAG 
AAGGTGTGGG CAAAGGCGAC GTTGGAGTTA AAGTT 

EF111-4 (SEQ ID NO:432) 

DEFA NVLDVSASPT ERTNGVYDTN YFNNFSDLGA 

WHGYYLPEKS NKELLGGFAG PLIIAEEYPV NLAASLNKLT VKNKKTGETY DLSQSNRMDL 
SYYPGRLEQT YELDDLTIHL ALIFVSNRTA LIQTTLENTG EEPLSLGASW TGAVFDKIQE 
GTETLDIGTR LTAKDNDIQV NFGEVRETWN YFATKDTKYT IHHADKVSTK IDNRNYTATA 
EPIELKPKQT YNTYTTESYT FTKEEEAKEQ QQAPEYTKNA ARYFKENKQR WQGYLDKTFD 
QKKTAEFPEY QNALVKSIET INTNWRSAAG AFKHDGIVPS MSYKWFIGMW AWDSWKADVA 
TADFNPELAK NNMRALFDYQ IQKDDTVRPQ DAGAIIDAVF YNQDSARGGE GGNWNERNSK 
PPLAAWAVWH IYQETKDKEF LKEMYPKLVA YHNWWYTNRD HNKNGIAEYG SMVSDAHWQK 
DDKDQIIKDK NGHLKWMMML LLKQPRGKVA WITLHGLTKK VWAKATLELK 
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EF117-1 (SEQ ID NO:433) 

TAATTCGATG GAGAAGGTGG TTTAGTGAAA AGATTTTCAT TTTTTTTACT AATTTTACTT 
GCTTTAACAG GTTGTAAATC CGGTGAAAAA GAATTTGATG AAGAATCTCT TCAAAATCTA 
AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA ATGGTGACGT TCGTTTAAAT 
GAATATATTT CTTTGAAAGG GGAGATTGTT GAGAGTGACA GTCGTTCCAG TTTAATAAAA 
AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT ATCAAGTTTN TAATGAGCAA 
AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG GAGAATATTA CGGCTTTTTG 
AAAGGGACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG CCACGAATTA G 

EF117-2 (SEQ ID NO:434) 

VKR FSFFLLILLA LTGCKSGEKE FDEESLQNLK ETXQSXSETE LQNGDVRLNE 
YISLKGEIVE SDSRSSLIKK GDRFILKSGS SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK 
GTLIESEENH DSATN 

EF117-3 (SEQ ID. NO:435) 

TG AAGAATCTCT TCAAAATCTA 

AAGGAAACGN CACAGTCTTA NTCAGAAACA GAATTACAAA ATGGTGACGT 
GAATATATTT CTTTGAAAGG GGAGATTGTT GAGAGTGACA GTCGTTCCAG 
AAAGGTGATC GTTTTATTTT GAAAAGTGGT TCTAGTAAAT ATCAAGTTTN 
AAGAAAAAAT TGAAGATTGG TGACGAAGTG ACAGTTTACG GAGAATATTA 
AAAGGGACAT TAATTGAAAG TGAGGAGAAT CATGATTCAG CCACGAA 

EF117-4 (SEQ ID NO:436) 

EESLQNLK ETXQSXSETE LQNGDVRLNE YISLKGEIVE SDSRSSLIKK GDRFILKSGS 
SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK GTLIESEENH DSATN 

EF118-1 (SEQ ID NO:437) 

TGAGGGGGAA AAAGTGTGTT AAAAAGAAAA GTGGGGATTG TCGCAGGCGT TTTCTGTTCA 
GCTTTGTTAC TGACAGGTTG TGGCAAAAGT GCGAAAGATG AGTTCATTCA AGGAATCGGC 
AATCANAACG CACAAGAATC TGGGGTTTGN GATTTCTCTA TGTCAATTAG TGACATGAAA 
TTTTCACAAG AAGATGGTGC ACAAACGAAT CCTATGATTG GGATGCTCAT CACGCAAATC 
AAAGACGCAT CGCTTTCTGG GG AAGATTCA AGTAGATGCC AAAAAAGAAA AAGCATTCAA 
CTTAGAGATG AAATTAAAAG CGATGGGAAT GGATGTACCG ATTTCATTGG TTGGATCGTT 
AGATAA 

EF118-2 (SEQ ID NO:438) 

VLKRKV GIVAGVFCSA LLLTGCGKSA KDEFIQGIGN XNAQESGVXD FSMSISDMKF 
SQEDGAQTNP MIGMLITQIK DASLSGEDSS RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR 

EF118-3 (SEQ ID NO:439) 
GAAAGATG AGTTCATTCA AGGAATCGGC 

AATCANAACG CACAAGAATC TGGGGTTTGN GATTTCTCTA TGTCAATTAG TGACATGAAA 
TTTTCACAAG AAGATGGTGC ACAAACGAAT CCTATGATTG GGATGCTCAT CACGCAAATC 
AAAGACGCAT CGCTTTCTGG GG AAGATTCA AGTAGATGCC AAAAAAGAAA AAGCATTCAA 
CTTAGAGATG AAATTAAAAG CGATGGGAAT GGATGTACCG ATTTCATTGG TTGGATCGTT 
AGAT 



TCGTTTAAAT 
TTTAATAAAA 
TAATGAGCAA 
CGGCTTTTTG 
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EF118-4 (SEQ ID NO:440) 

KDEFIQGIGN XNAQESGVXD FSMSISDMKF SQEDGAQTNP MIGMLITQIK DASLSGEDSS 
RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR 

EF119-1 (SEQ ID NO:441) 

TAAAGAATAC CGAGTAAAAT TTTCGGAAGG CTTTTTTTCA AAAATTGTAT ATGCAAAAGA 
AGTGCAACGG AAAGGAGCTC GGAAATCGTG AATAAGCTAC CTTTACTTAT TTTATTGTTA 
GGCGGAG TGT TGCTTGTTAG TGGCTGTCAA AGCCATAAGG AAGAAAACAA GTCTAGTAAA 
GTATCGACAG AAGAAACGAC AGTGATTGAA ACAGTAGCAA GGGAACAATC GAAGGAATCG 
TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA AATTAGAAGA ACCAGATCAT 
GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT TTACAAGTAT TAATGATCGC 
AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA AAAAAAATGG AATTGATGTT 
AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA CGATTTATAA AAATGATCAA 
CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATGGAA CGCAGACACG AGTGTTACTT 
TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGACCT ATAATTCAGT TAAGCAAGAG 
TATTAG 



EF119-2 (SEQ ID NO: 442) 

VN KLPLLILLLG GVLLVSGCQS HKEENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE 
TTKLEEPDHV KLLEAYGNAY ANFTSINDRN EKLKPLMTEK CIKKNGIDVK TGVALVSVGK 
VTTIYKNDQH EYALLLDCEQ NGTQTRVLLL AKVKNNKISE MTYNSVKQEY 

EF119-3 (SEQ ID NO:443) 

AGAAAACAA GTCTAGTAAA 

GTATCGACAG AAGAAACGAC AGTGATTGAA ACAGTAGCAA GGGAACAATC GAAGGAATCG 
TTTACGAGTG AAGCAACTAA AAAACAGACA GAAACAACGA AATTAGAAGA ACCAGATCAT 
. GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT TTACAAGTAT TAATGATCGC 
AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATCA AAAAAAATGG AATTGATGTT 
AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA CGATTTATAA AAATGATCAA 
CATGAATATG CTTTACTTTT GGATTGTGAA CAAAATGGAA CGCAGACACG AGTGTTACTT 
TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGACCT ATAATTCAGT TAAGCAAGAG 
TAT 

EF119-4 (SEQ ID NO:444) 

ENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE TTKLEEPDHV KLLEAYGNAY 
ANFTSINDRN 

EKLKPLMTEK CIKKNGIDVK TGVALVSVGK VTTIYKNDQH EYALLLDCEQ NGTQTRVLLL 
AKVKNNKISE MTYNSVKQEY 

EF120-1 (SEQ ID NO:445) 

TGAATAGGCG TGAAAAAGGG AATGTTAGCG TTTTTTGTCG TGCTAGCGGT TTTATCATTA 
ACTGCTTGTC GGGAACCAAA AGNAAAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA 
GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC 
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT 
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTGGCGA TTTTACTACT 
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT 
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ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA 
AGC TTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT 

EFl'20-2 (SEQ ID NO:446) 

VKKGMLAF FWLAVLSLT ACREPKXKKV TASTEASSKV EETNEKTSET IDKTNEQASS 
SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV 

EF120-3 (SEQ ID NO:447) 

AAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA 

GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAACGA ACAAGCGAGC 
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATGAAGAGC CGACAGCTGA TGGAAACAAT 
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTGGCGA TTTTACTACT 
TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA TCATTCATCC TGATGGTAGT 
ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG ACCAATTACA 
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCGCTAT TATTATATAA AATTGGTGTT 

EF120-4 (SEQ ID NO:448) 

KKV TASTEASSKV EETNEKTSET IDKTNEQASS 

SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV 



EF121-1 (SEQ ID NO:449) 

TGAAACACAA GGAGGAAATT TGTGAAAAAG TTGAGCTTTA AAAAAGTGAA GTGGGGCATG 
CATTTTTTAA TGGCTGTTGC GTTGATAGCG CCAAGTGTTA CTAGTACGGC ATATGCAGTA 
GAAACAACGA GTCAACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA 
CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC AAAAAATAAC AATTTTAGGT 
ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTCTTATG AAGATGATAA AGAACTACCA 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGACTTGTAT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT GAATGTGATG 
AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT CGATTCATGC GGGACAACAA 
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG GATTGATGCG 
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC 
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTAGA AAAACAAGGG 
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GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGAC CCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA 
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT 
TAG 

EF121-2 (SEQ ID NO:450) 

VKKL SFKKVKWGMH FLMAVALIAP S VTS TAYAVE TTSQQSSEAV TSTTDSSRKQ 
EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE 
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PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQ I 
PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGFLPKTGTE TETLALYGLL FVGLSSSGWY IYKRRNKAS 

EF121-3 (SEQ ID NO:451) 

ACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA 

CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC AAAAAATAAC AATTTTAGGT 
ACGTC AGATG TTCATGGTC A ATTATGGAAT TGGTCTTATG AAGATGATAA AGAACTACCA 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGACTTGTAT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT GAATGTGATG 
AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC 
CCTCGTGTTA CTTCGCTTAA TTOTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT CGATTCATGC GGGACAACAA 
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG GATTGATGCG 
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA 
. ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC 
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTAGA AAAACAAGGG 
GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCG 

EF121-4 (SEQ ID NO:452) 
QSSEAV TSTTDSSRKQ 

EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
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KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE 
PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 
PTSPDGRTPN A 



EF122-1 (SEQ ID NO:453) 

TGAAACACAA GGAGGAAATT TGTGAAAAAG TTGAGCTTTA AAAAAGTGAA GTGGGGCATG 
CATTTTTTAA TGGCTGTTGC GTTGATAGCG CCAAGTGTTA CTAGTACGGC ATATGCAGTA 
GAAACAACGA GTCAACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC AAGTAGAAAA 
CAAGAACCAG TCATTACACA GGAAACAACA GACATCAAAC AAGAAGCACC AAATCAGGCT 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA ACCACAGCAC CAACAGAGAC GACGAATTTA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG AGCACGCCGC AAAAAATAAC AATTTTAGGT 
ACGTCAGATG TTCATGGTCA ATTATGGAAT TGGTCTTATG AAGATGATAA AGAACTACCA 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT AACCAAGTCC GGGCACAAAA CCCAGCAGGC 
ACCGTTTTAA TTGATAATGG CGACAATATT CAAGGCACTA TTTTAACAGA TGAC TTGTAT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC CATCCAATGA TCACCGCCAT GAATGTGATG 
AAGTATGATG CAATGGTTTT GGGAAATCAT GAGTTTAATT TTGGTTTACC GTTAATCAAA 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
GGTCTTCGTT TTGTTGAAGG GACTACCACG AAGGAACTTG ATTTTAATCA AGATGGGCAG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA ACAATTCCGC ACATTCCTTT GTGGGATGGC 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT TTGAAAGAAG AAGCAGAAAA AGCAGTTACT 
GAGTTGAAAG CTAACGATCA GGCTGACATT ATTGTTGCCT CGATTCATGC GGGACAACAA 
AATAGTGATC CGGCTGCCAG TGCCGACCAA GTAATTGAAA ATGTCGCGGG GATTGATGCG 
TATATTCTGG GTCATGACCA CCTTTCTTTT ACCAAGCAAG GAGCAGCGCC GAATGGAAAA 
ACTGTACCGG TAGGGGGACC GAAAGATACG GGGACAGAAG TTGTCAAAAT TGATCTTTCA 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG CAAGAAGGTA CAGCAACGAT TGTACCAACA 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG GCAGCGACAA AAGAATACCA TGAAAAAACG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA GCAACAGCTG ATTTTTTACC AAAACAAGAA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA CCAACAGCGA TGATTTCTTT AATTAATAAC 
GTTCAAAAAG AAGTAACGGG CGCACAATTA AGTGCGGCAG CGCTGTTTAA ATACGACAGT 
AAATTACCTG CGGGGAAGAT TTCCTATGCC ACGATTTTTG ATATCTACAA ATACCCGAAT 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA AACTTACTGA AGTATTTAGA AAAACAAGGG 
GCGTACTATA ACCAAACACA GCCAGATGAT TTGACCATTA GTTTTAATCC AAACATTCGT 
GTATATAACT ATGACATGAT TTCTGGAGTG GACTACAAGA TTGACATTTC AAAACCAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC GGCCAACCGC TGGATCCTGC CAAAGAATAT 
ACGATTGCTA TGAATAATTA TCGTTACGGC GGTTTAGCTA GCCAAGGGAT TCAAGTAGGG 
GAACCTATTA AAAATTCTGA TCCAGAAACC TTACGAGGAA TGATTGTTGA TTATATTAAG 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA ATCGAACGAA ATTGGTCAAT TATTGGGACA 
AATTTTGATG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT 
GGC CGAC TAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTC TAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGCCAAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG AC GGGTTTTT 
GATC CGTATA CAATCGTAGA AAAATC CGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
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CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGATCGTC ACAGCACCAG AGAGTGAACC AAC T AAG AAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTC C AC A AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TGGTATCCAC TATGATTCCA CAAAACCAGG TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGCAACC 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TTTTACCTAA AACGGGTACA GAAACAGAAA CGCTTGCATT ATATGGTTTA 
CTGTTCGTTG GACTTTCTTC TTCTGGCTGG TATATTTATA AACGACGTAA CAAAGCTAGT 
TAG 



EF122-2 (SEQ ID NO:454) 

VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE TTSQQSSEAV TSTTDSSRKQ 
EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE TS I AEKEETS TPQKITILGT 
SDVHGQLWNW SYEDDKELPV GLSQVSTWN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 
KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFGLPLIKK IQQEATFPIL SANTYNKEDG 
LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE 
LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT 
VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EGTATIVPTT NVPADEAVKA ATKEYHEKTR 
AFIQEEIGTA TADFLPKQEI KGIPEAQLQP TAMISLINNV QKEVTGAQLS AAALFKYDSK 
LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDL TISFNPNIRV 
YNYDMISGVD YKIDISKPVG ERIVDAKIDG QPLDPAKEYT IAMNNYRYGG LASQGIQVGE 
PIKNSDPETL RGMIVDYIKK KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQI 
PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPHIWRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLEX3QRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGFLPKTGTE TETLALYGLL FVGLSSSGWY IYKRRNKAS 



EF122-3 (SEQ ID NO:455) 
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TG AAAAATGGCG TGCCAAAGCA ATC G AATTAG TGAATGACGG CACTCTTCAA 
ATTCCGACTT CTCCTGATGG ACGTACACCA AACGCCGCCG CTATTACGAA ACAAGATGTC 
CGTAATGCGG GCTTTGATTT AGATAATGCA TATACCATTA TGCACACAAA TGACGTTCAT 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA GGTATGGCGC GTCTAAAAAC CTTTAAAGAC 
CAAGAAAACC CAACCTTGAT GGTGGATGCA GGGGATGTTT TCCAAGGATT ACCAATCTCC 
AATTTCTCCA AAGGCGCGGA TATGGC C AAA GCAATGAATG AAGTTGGTTA TGATGCCATG 
GCGGTGGGAA ATCACGAGTT TGATTTTGGT TTAGAGATTG CACTAGGTTA TAAAGACCAA 
CTGAATTTTC CGATTTTATC TAGTAATACG TATTACAAAG ATGGCAGTGG ACGGGTTTTT 
GATCCGTATA CAATCGTAGA AAAATCCGGG AAAAAGTTTG CCATTGTAGG TGTGACGACC 
CCAGAAACAG CAACGAAAAC ACACCCGAAA AACGTAGAGA AGGTGACATT TAAAGACCCG 
ATTCCAGAAG TAGAAGCAGT GATTAAGGAA ATTAAAGAGA AGTACGCGGA TATNCAAGCT 
TTCGTGGTTA CTGGGCATTT AGGCGTAGAT GAAACGACGC CGCATATCTG GCGTGGTGAT 
ACGCTAGCAG AAACCCTTAG TCAAACATAT CCTGAGTTAG ATATCACTGT GATTGATGGA 
CATTCGCATA CAGCCGTCGA AAGTGGCAAA CGTTATGGCA AAGTGATCTA TGCTCAAACA 
GGTAATTATT TAAATAATGT TGGGAT.CGTC ACAGCACCAG AGAGTGAACC AACTAAGAAA 
ACAACAAAAT TGATTTCAGC AGCAGAGCTG CTAGAATTGC CAGAAAACCC GGCAGTTAAA 
GCCATCGTTG ATGAAGCACG TACGAATTTT AACGCTGAAA ATGAAAAAGT AATTGTCGAT 
TATATTCCAT TCACATTGGA TGGACAACGA GAAAATGTGC GCACACGAGA GACCAACTTA 
GGGAATTTGA TTGGTGATGC GATTATGTCA TATGGCCAAG ACGCGTTTAG CCAACCTGCT 
GATTTTGCAG TAACTAATGG TGGCGGCATT CGCGCTGATA TTAAACAAGG GCCAATTAAA 
GTTGGGGATG TCATTGCTGT GTTACCTTTT GGCAATAGCA TTGCGCAAAT TCAAGTAACC 
GGCGCCCAAG TTAAAGAAAT GTTTGAAATG TCTGTTCGTT CGATTCCACA AAAAGATGAG 
AATGGCACAA TTTTACTAGA TGATGCTGGC CAACCAAAAC TTGGCGCAAA TGGTGGTTTC 
CTACATGTTT CAAGCTCCAT TCGTATCCAC TATGATTCCA CAAAACCAGG . TACTCGCTTG 
GCTAGTGACG AAGGCAATGA AACAGGACAA ACGATTGTCG GTAGTCGCGT ATTAGGAATA 
GAAATTAAAA ATCGGCAAAC ACAAAAGTTT GAACCATTGG ATGAGAAGAA ACAATACCGG 
ATGGCTACCA ATGATTTCTT AGCTGCTGGT GGTGATGGTT ACGATATGCT AGGTGGTGAA 
CGAGAAGAAG GGATTTCACT AGATTCTGTC TTAATTGAAT ACTTGAAAAG TGCAACCAGC 
TTGCGGTTGT ATCGTGCAGC AACGACGATT GATTTAGCAC AATATAAAGA ACCATTCCCA 
GGCGAACGAA TTGTTTCTAT TTCGGAAGAA GCTTACAAAG AGTTAATCGG TGGAGGAGAG 
ACGCCAAAAC CAGATCCAAA ACCAGACCCG AAACCAACAC CAGAAACACC AGTAGC AAC C 
AATAAACAAA ACCAAGCGGG AGCAAGACAG AGCAATCCAT CCGTAACAGA GAAGAAAAAG 
TATGGCGGCT TT 

EF122-4 (SEQ ID NO:456) 

EKWRAKAI ELVNDGTLQI 

PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVHG RLEAGKGELG MARLKTFKDQ 
ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNEVGYDAMA VGNHEFDFGL EIALGYKDQL 
NFPILSSNTY YKDGSGRVFD PYTIVEKSGK KFAIVGVTTP ETATKTHPKN VEKVTFKDPI 
PEVEAVIKEI KEKYADXQAF WTGHLGVDE TTPH I WRGDT LAETLSQTYP ELDITVIDGH 
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA 
IVDEARTNFN AENEKVIVDY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY GQDAFSQPAD 
FAVTNGGGIR ADIKQGPIKV GDVIAVLPFG NSIAQIQVTG AQVKEMFEMS VRSIPQKDEN 
GTILLDDAGQ PKLGANGGFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE 
IKNRQTQKFE PLDEKKQYRM ATNDFLAAGG DGYDMLGGER EEGISLDSVL IEYLKSATSL 
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK PTPETPVATN 
KQNQAGARQS NPSVTEKKKY GGF 



EF123-1 (SEQ ID NO:457) 

TAAAATAAAA AATTGGTACG AAGTGAACGT 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG 



TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
GTAAACCGTT GGCTCTACGG GTTGATGTGT 
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TTGTTACTTG TTCTAAATTA TGGCACACCA CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG 
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG 
TTGACGTTAA AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTATTTAA. ATCCTGATGG AACACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA 
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG 
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CG TAG TGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGATGAATG TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCC AGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
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CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TAC CAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGC TGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGC CAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGC CTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 

EF123-2 (SEQ ID NO:458) 

MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 
GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VWHYSANVS IKAAHWAAPN 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE 
SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD 



NTRKIQVDDQ 
STAITSSPAA 
ISDYFTGDET 
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TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 

EF123-3 (SEQ ID NO:459) 

GGAAGA GGTTAACAGC 

GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG 
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG 
TTGACGTTAA AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTATTTAA ATCCTGATGG AACACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA 
G ATGTGC GAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG 
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTAC CTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
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GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGG 

EF123-4 (SEQ ID NO:460) 

EEVNSD 

GQLTLGEVKQ tsqqemtlal qgkaqpvtqe wvhysanvs ikaahwaapn ntrkiqvddq 

KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 
TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID W 



EF124-1 (SEQ ID NO:461) 

TAAAATAAAA AATTGGTACG AAGTGAACGT TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG GTAAACCGTT GGCTCTACGG GTTGATGTGT 
TTGTTACTTG TTCTAAATTA TGGCACACCA CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG 
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG 
TTGACGTTAA. AGACTGGTAC TGATCCGACA GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
TC CGAAG AAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATT ATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTATTTAA ATCCTGATGG AACACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA 
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG 
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AAACCTAATA AACCAGGTTC AGGTGATTTA ■ GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCC ATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC ' GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
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CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT - GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTC TTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAAC CGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 

EF124-2 (SEQ ID NO:462) 

MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE WVHYSANVS IKAAHWAAPN NTRKIQVDDQ 

KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 

SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 

TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 

PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 

GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 

VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 

YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 

EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 

DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 

SQQNI IKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 

DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 

NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 

WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 

KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
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PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQ I VKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 

EF124-3 (SEQ ID NO:463) 

TGCCTTCCACATAACTTATACTACCTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG . TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG G 



EF124-4 (SEQ ID NO:464) 
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AF HITYTTFFDV TELDANNPAL 

DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSS RDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQG 



EF125-1 (SEQ ID NO:465) 

TAAAATAAAA AATTGGTACG AAGTGAACGT TCTCTTCTAT GTGTCGTTAG TAGAGGAAGG 
ATGAAAGAAA TGAGAAAGAA TGGTCCAATG GTAAACCGTT GGCTCTACGG GTTGATGTGT 
TTGTTACTTG TTCTAAATTA TGGCACACCA CTCATGGCTT TGGCGGAAGA GGTTAACAGC 
GATGGCCAGT TAACGTTAGG AG AAG TGAAG CAAACCAGCC AGCAAGAAAT GACCTTAGCG 
CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TGCATTATAG TGCCAATGTG 
TCAATCAAAG CTGCACATTG GGCAGCGCCC AATAATACGC GCAAGATTCA AGTGGATGAC 
CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 
ACGTTGAACC CTACAGCTAC AGAAGATGTG ACGTTTTCTT ATGGACAACA GCAACGAGCG 
TTGACGTTAA AGACTGGTAC TGATCCGACA. GAATCAACGG CAATCACGAG TTCGCCAGCC 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTCCTCTGT TCCTCGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC TGAATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATTATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTATTTAA ATCCTGATGG AACACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA 
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TTCAATTACC TGGCAATTTG 
AAAC CTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG AAGGCAATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATGATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAAGATT TGCCGCCTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTTTGATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAAGATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AACTTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CCAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GGTTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAAC AAACCATTCC AAAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AACCAGATTC TTTACATTTA 
TATTCAGTGA CATTTGATGA CAAAGGAAAT GAAGTCGTTG GAGCAGAACT TGTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTTG CAATTGACTT TTTACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATGGAATTGT CGAAGGCGAT 
GTTGCGGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGC ATTCAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAAC AC T GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAATT CGTTGGTTGT CAAAGATACA 



WO 98/50554 



PCT/US98/08959 



231 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTTCATGG TAGAAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA TGCGCAAAAA 
AGCGGTG TTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGATTGCGGT TAATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTGACG GATCCAATTT TAACCAATCA AACCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATGGTTCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG. TTGATTTTCC TAATGATAGT CGTACGTATG TGATTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC ACCAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACACACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGAGTACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGACTAAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTTAGCAG GCGCCCATTT CCAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTACAGTAG ATGCCACCGG GGTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTGGAGA CAAAAGCACC AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TTGATGAAGA AACTTCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATGTCAATA AAGTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTTTAAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCCGGATC GAACCAACGC GAATGGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCAGTTCA CAGAAATCGA AGCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCATCGTGA CACAAAATAC GAGCGGACAA 
ATTCGTGATG TTCATGTCAA AATGC TTAAT TACCAAGGTT CTGCTGAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAATTTTCAG TCCTTGACAC CACAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCACAGTGAC GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCCAGCAG GGTACCTTTT AAACACTGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CAGCAACAGT TATAGCAACG 
GCTAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCAGT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATCCGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
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ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAACTTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT C TTAATG AAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
C TTGGTTTAC CGAAAACCAA CACACAAGTC AATTACTTCT TTGTCTTTAT CGGCCTCATG 
TTGGTCGGTT TGGCAAGTTG GCTCTTCTAT AAAAAGAGCA AGAAATAA 

EF125-2 (SEQ ID NO:466) 

MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VWHYSANVS IKAAHWAAPN NTRKIQVDDQ 
KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA 
SANEGSTEEA STNSSVPRSS EETVASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 
TIIDNFEDPI YLNPDGTPAT PPYKEDVTIH WNFNWSIPED VREQMKAGDY FEFQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNERITSES DIHGDFSLDT HLNDSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK QGHFDRTPNP SAITWTVDIN QAMKDQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GRELSPDEYT VDKNGNVTIK GDTNKAYRLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PNGLDAEATV TATYGKMLDK RNIDYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDSLHLY SVTFDDKGNE WGAELVEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVDGIVEGDV AVNNRVDVGT GQHSEDDGTA 
SQQNIIKNTG AVDYQNSTIG WTLAVNQNNY LMENAVITDT YEPVPGLTMV PNSLWKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAKTSDAF HITYTTFFDV TELDANNPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFKPL PAFDLNAQKS GVYNAVTKEI TWTIAVNLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGNT KPDGSVEKVK PTQPLTDITM EEPSEKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YDNTASYTNQ GSSRDVTGKV SIQHGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITDTPSPNQV LDPESLVIYG TNVTEDGTIT 
PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAHIEAPYY MEYRSLVTSS AAGSTDTVSN 
QVSITGNGSE WHGDDNGDV WDIDHSGGH ATGTKGKIQL KKTAMDETTI LAGAHFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VETKAPEGYT VSDELAKGRV ITIDEETSAE 
GAQPTIIKND VNKVFLEKMD EKGKKLVNAR FKLEHAVTTP FTHWEEVPLA PDRTNANGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTQNTSGQI RDVHVKMLNY QGSAELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN GKVTVTDLAP GKYQFVETKA PAGYLLNTEP 
SAFTIAASDR GKPATVIATA NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFT I AATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKTNTQVN YFFVFIGLML 
VGLASWLFYK KSKK 



EF125-3 (SEQ ID NO:467) 
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TAACTTTG TTAACTATCA AGGCACGGCT AAATTAATCA AAAAAGATGT GAATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGCGAAGG GAGAAACGAT TCAAACAGGC 
TTGACGACAA ATAATCAAGG GGAAATTGTT GCAGAGCACT TAGCCCCAGG AAAATATCGC 
TTTGTAGAAA CCAAAGCGCC AACAGGCTAT TTATTAAATA CCACGCCAGT CCCATTTGAA 
ATTGCTGAGA AAAATGCTGG TAAACCAGCG GTCGTGGTTG CTAGTGACAA CTTTGTGAGT 
TACAAAGGGG CTTTCCAAAT CGTGAAAACG AATAGCGCAG ACCAACCATT AGCAGGTGCT 
GTTTTTGAAT TATATGATCA CAATAAACAA TCATTAGGGA TTACAGCAAC GAGTGGCAAA 
GATGGCAAAA TTATCTTTAG AGACTTGGCG CCAGGTACCT ATTATTACAA AGAAATCAAA 
GCACCAAAAT TACCAGATGG CGCAGATTAT ATTATTTATC CTGAATTAGT AAAAGTAGAA 
ATTCGTGGTG ATTTCAAAGG TGATC CGGAG ATTTTCCAAT TAGGGGCCTT CGCCAATTTC 
AAAGGACGCG CCGTCTTTAA GAAAATTGAT GCCAATGCGA ACCCACTTCC AGGAACGATT 
TTTAAATTGT ATCGAATCGA AAACGGGGAA AAAATCTTTG AAAGAGAAGT AACTGCTGAA 
AAAGATGGTT CATTGGCTAT GGAGGATTTA GGTGCTGGTA GCTATGAATT AGATGAACTG 
GATGCAACGG ATGGCTATAT CGTCAATAAA CAACCCATTT ATTTTGTAGT GAAGAAGAAT 
TCAAATGATA AACAACCACT AGATGAGTTA GAGTTTGTAA ATTATCAAGC AGAAGTAATG 
GGACGTAAAG TCAACGAGCA AGGTCAAACC TTAGCGGGTG CAGTTTTTGC AATTTACAAT 
GCCGATGAGC AGAATCAGCC CCAAGGTTCA CCGATAACAT TCTTGAATCG TGCAGGAGAA 
AAAGTTTCTG AAATAACAAC GGATAAGACT GGCGAAATTT ACGCTAAAGG GCTAAATGAA 
GGGCATTACG TTTTAGTGGA AACGAAAGCA CCAACAGGCT ATCTGTTAGA CACAACGCTA 
CATCCATTTG ATGTAACCGC CCAATTAGGA AAAGAGCAGC CAATTGCTTT AGGCGATCTT 
ATCAATTATC AAGGAACTGC TCAATTAACC AAAGAAAACG AAACAGGTGA AGCATTGGCA 
GGTGCGGTGT TTAAGGTCAT TGATGAAACA GGGCAAACCG TAGATGGACA AACCAATCTG 
ATGTCTGACA AGCAAGGCAA AGTCATTGCG AAAAAC TTAG CACCGGGAAC GTATCGTTTT 
GTGGAGACAC AAGCGCCAAC TAGCTATCTT CTTAATGAAA CGCCAAGCGC AAGCTTTACG 
ATTGCCAAAG ACAACCAAGG CAAACCAGCC ACTGTGGTAC TTAAAGCACC TTTTATTAAT 
TACCAAGGTG CTGCCAAGCT GGTGAAAATT GATCAGCAAA AGAATGCCTT AGCAGGTGCT 
GAATTTAAAG TGACAGATGC AGAGACAGGG CAAACTGTCG CTCGTTCATT ACGTTCTGAC 
AACCAAGGGT TAGTTCAAGT GAACCACTTA CAACCAGGAA AATATACCTT TGTGGAAACA 
AAAGCACCGG ATGGTTACCA ACTGTCTAAG CAAGCTGTCG CATTCACTAT TGCGGCAACA 
GCGAAAGACA AACCTGAACT CGTGAATGCG GGCACGTTTG TTAACGAGAA ACAACCTGTA 
TCCAAAAAAA CAAAACCAAA TCAGCCAACA ACGAAACAAG CAGCTAGAGA GACAGGTTGG 
CTTGGT 

EF125-4 (SEQ ID NO:468) 

NFVNYQGTAK LIKKDVNGHL LSGATFKVLD AKGETIQTGL 

TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LNTTPVPFEI AEKNAGKPAV WASDNFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LGITATSGKD GKIIFRDLAP GTYYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FQLGAFANFK GRAVFKKIDA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AGSYELDELD ATDGYIVNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGAVFAIYNA DEQNQPQGSP ITFLNRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDVTAQLGK EQPIALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QTVDGQTNLM SDKQGKVIAK NLAPGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WLKAPFINY QGAAKLVKID QQKNALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKYTFVETK APDGYQLSKQ AVAFT I AATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWLG 

EF126-1 (SEQ ID NO:469) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
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TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTC AATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA. ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
AC TGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACGAACGCCA 
GAAGATC C AA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CC AC TAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF126-2 (SEQ ID NO:470) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
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I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF126-3 (SEQ ID NO:471) 
TGAA 

GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGG TGAAAGT G AAAG AC G AC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGAT 

EF126-4 (SEQ ID NO:472) 

EE AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLWVE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV . 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAID 

EF127-1 (SEQ ID NO:473) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGG TGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
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AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGAC'GATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
AC TG ATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA GACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
. TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF127-2 (SEQ ID NO:474) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLWVE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQL.TYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
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DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 

ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 

EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 

ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 

TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 

PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF127-3 (SEQ ID NO:475) 

GAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 

ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAAT 



EF127-4 (SEQ ID NO: 47 6) 
NQG TIAKEFPEAT 

I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDIN 

EF128-1 (SEQ ID NO:477) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATC TTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
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TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGAC TATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF128-2 (SEQ ID NO:478) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTMNQG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KG IHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF128-3 (SEQ ID NO:479) 
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AGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 

CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC . ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGAGCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCAT 

EF128-4 (SEQ ID NO:480) 

DENGK DVTANGKVTQ 

ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 

EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 

ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 

TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 

PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 



EF129-1 (SEQ ID NO:481) 

TGACAAGTGA AGAAACGTCT ATTTGCATCA 
ATTGCTACCC CAAGCATCGC TTTGGCGGAC 
CAAGAAATTT CATCATTAAA AGCAAAACAA 
GAAGCAGAAG TATCTTCAGT ATTTGATGAA 
CTAAAAGCAA AATCAGAACA ATTACAACAA 
AAACGTAACG AAGCAATCAA AAATCAAGCA 
ACAATGCTAG ATGCAGTTTT AGATGCGGAC 
GCTGTTTCAA CAATCGTAAG TGCCAACAAC 
CAAGCCGTTG TTGATAAAAA AGCTGAAAAC 
GAAGCTGAAT TAGAAACAAA ACGTCAAGAT 
ATGAAAGCTT CATTAGCATT AGAACAATCA 
AAACAAAAAG CAGCTGCTGA AGCAGAGCAA 
GCTGAAAAAG CCAAACAAGC TGCTGCAAAA 
CCAGTTGCCT CTTCATCAAC AACAGAAGCA 
GAATCAAGCA CGCAACAAAC AACTGAAACA 
GAAAATACTG GCTCTTCTTC ATCAGAACAA 
GGAAATAATG GTGGCCAAAC TGGTGGTGGA 
GCGCCTTCTG CTGATCCAAC AATCAATGCA 
C GTC CAGTAG TATGGGATGC AGGTTTGGCA 
GAAGCAGGTG GCATTCCAAA TGATCACTGG 
TGGGCGCCAG GTAACTCAGT AATCATGGCG 
TCAGGAAGCG GTCACCGTGA TTGGGAAATT 
TACTCAGGTA GCACAATCGT AGGACACTCA 



GTATTACTAT GTTCATTAAC GCTATCAGCA 
AATGTTGATA AAAAAATTGA AGAAAAAAAT 
GGGGATTTAG CTTCACAAGT ATCTTCTTTA 
AGCATGGCTT TACGTGAACA AAAGCAAACA 
GAAATTACAA ACTTGAATCA ACGTATTGAA 
CGTGATGTTC AAGTTAATGG ACAAAGCACA 
TCAGTTGCAG ATGCAATCAG CCGTGTTCAA 
GACTTAATGC AACAACAAAA AGAAGACAAA 
GAGAAAAAAG TGAAACAACT TGAAGCAACA 
TTACTTTCTA AACAATCTGA ATTAAACGTA 
TCAGCTGAAA GTTCTAAAGC TGGCTTAGAA 
GCACGCTTAG CTGCTGAACA AAAAGCTGCA 
CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA 
CAAGCACCAG CAAGCTCAAG CTCAGCAACT 
ACTACACCAA GTACAGATAA TAGTGCAACA 
CCAGTACAAC CTACAACACC AAGCGATAAT 
ACAGTTACAC CAACACCAGA ACCAACACCA 
TTGAACGTTC TACGTCAATC ATTAGGTTTA 
GCTTCTGCAA CTGCTCGTGC AGCACAAGTT 
TCTCGTGGAG ATGAAGTTAT CGCAATTATG 
TGGTACAATG AAACAAACAT GGTAACAGCT 
AACCCAGGTA TTACGCGTGT CGGTTTTGGT 
GCCTAA 



EF129-2 (SEQ ID NO:482) 
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VKKRLFASV LLCSLTLSAI ATPSIALADN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE 
AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRIEK RNEAIKNQAR DVQVNGQSTT 
MLDAVLDADS VADAISRVQA VSTIVSANND LMQQQKEDKQ AWDKKAENE KKVKQLEATE 
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA 
EKAKQAAAKP AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE 
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR 
PWWDAGLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS 
GSGHRDWEIN PGITRVGFGY SGSTIVGHSA 

EF129-3 (SEQ ID NO:483) 

GGAC AATGTTGATA AAAAAATTGA AGAAAAAAAT 

CAAGAAATTT CATCATTAAA AGCAAAACAA GGGGATTTAG CTTCACAAGT ATCTTCTTTA 
GAAGCAGAAG TATCTTCAGT ATTTGATGAA AGCATGGCTT TACGTGAACA AAAGCAAACA 
CTAAAAGCAA AATCAGAACA ATTACAACAA GAAATTACAA ACTTGAATCA ACGTATTGAA 
AAACGTAACG AAGCAATCAA AAATCAAGCA CGTGATGTTC AAGTTAATGG ACAAAGCACA 
ACAATGCTAG ATGCAGTTTT AGATGCGGAC TCAGTTGCAG ATGCAATCAG CCGTGTTCAA 
GCTGTTTCAA CAATCGTAAG TGCCAACAAC GACTTAATGC AACAACAAAA AGAAGACAAA 
. CAAGCCGTTG TTGATAAAAA AGCTGAAAAC GAGAAAAAAG TGAAACAACT TGAAGCAACA 
GAAGCTGAAT TAGAAACAAA ACGTCAAGAT TTACTTTCTA AACAATCTGA ATTAAACGTA 
ATGAAAGCTT CATTAGCATT AGAACAATCA TCAGCTGAAA GTTCTAAAGC TGGCTTAGAA 
AAACAAAAAG CAGCTGCTGA AGCAGAGCAA GCACGCTTAG CTGCTGAACA AAAAGCTGCA 
GCTGAAAAAG CCAAACAAGC TGCTGCAAAA CCAGCTAAAG CTGAAGTGAA AGCAGAAGCA 
CCAGTTGCCT CTTCATCAAC AACAGAAGCA CAAGCACCAG CAAGCTCAAG CTCAGCAACT 
GAATCAAGCA CGCAACAAAC AACTGAAACA ACTACACCAA GTACAGATAA TAGTGCAACA 
GAAAATACTG GCTCTTCTTC ATCAGAACAA CCAGTACAAC CTACAACACC AAGCGATAAT 
GGAAATAATG GTGGCCAAAC TGGTGGTGGA ACAGTTACAC CAACACCAGA ACCAACACCA 
GCGCCTTCTG CTGATCCAAC AATCAATGCA TTGAACGTTC TACGTCAATC ATTAGGTTTA 
CGTCCAGTAG TATGGGATGC AGGTTTGGCA QCTTCTGCAA CTGCTCGTGC AGCACAAGTT 
GAAGCAGGTG GCATTCCAAA TGATCACTGG TCTCGTGGAG ATGAAGTTAT CGCAATTATG 
TGGGCGCCAG GTAACTCAGT AATCATGGCG TGGTACAATG AAACAAACAT GGTAACAGCT 
TCAGGAAGCG GTCACCGTGA TTGGGAAATT AACCCAGGTA TTACGCGTGT CGGTTTTGGT 
TACTCAGGTA GCACAATCGT AGGACACTCA GCC 

EF129-4 (SEQ ID NO:484) 

DN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE 

AEVSSVFDES MALREQKQTL KAKSEQLQQE ITNLNQRIEK RNEAIKNQAR DVQVNGQSTT 
MLDAVLDADS VADAISRVQA VSTIVSANND LMQQQKEDKQ AWDKKAENE KKVKQLEATE 
AELETKRQDL LSKQSELNVM KASLALEQSS AESSKAGLEK QKAAAEAEQA RLAAEQKAAA 
EKAKQAAAKP . AKAEVKAEAP VASSSTTEAQ APASSSSATE SSTQQTTETT TPSTDNSATE 
NTGSSSSEQP VQPTTPSDNG NNGGQTGGGT VTPTPEPTPA PSADPTINAL NVLRQSLGLR 
PWWDAGLAA SATARAAQVE AGGIPNDHWS RGDEVIAIMW APGNSVIMAW YNETNMVTAS 
GSGHRDWEIN PGITRVGFGY SGSTIVGHSA 

EF130-1 (SEQ ID NO:485) 

TGATACATTA AAAGGAGGGA AAATATGCGC CCAAAAGAGA AAAAAAGAGG AAAAAATTGG 
TTAATCAACA GTTTATTAGT TTTACTATTT ATCATTGGCT TAGC CTTAAT TTTTAACAAT 
CAGATACGTA GTTGGGTGGT TCAACAAAAT AGCCGCTCGT ACGCCGTTAG CAAGTTGAAA 
CCAGCTGATG TGAAGAAAAA TATGGCTCGT GAAACAACGT TTGACTTTGA TTCAGTTGAG 
TCCTTGAGCA CAGAAGCGGT GATGAAAGCC CAATTTGAAA ACAAAAACTT ACCTGTGATT 
GGTGCCATTG CGATACCAAG TGTCGAAATT AATTTGCCCA TTTTTAAAGG ATTGTCCAAT 
GTCGCTTTAT TAACTGGTGC CGGGACCATG AAAGAAGATC AAGTCATGGG GAAAAACAAT 
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TATGCCTTGG CTAGTCATCG AACGGAAGAT GGCGTTTCCT TATTTTCACC TTTAGAAAGA 
ACCAAAAAAG ACGAACTCAT TTATATCACT GATTTATCTA CTGTTTATAC ATACAAAATA 
ACTTCTGTAG AAAAAATCGA ACCAACCCGT GTTGAGTTAA TTGATGACGT TCCTGGTCAA 
AATATGATTA CCTTAATTAC CTGTGGCGAT TTACAAGCAA CGACGCGAAT TGCTGTTCAA 
GGAACATTAG CAGCAACGAC GCCTATTAAA GACGCCAACG ACGATATGTT GAAGGCTTTC 
CAATTGGAGC AAAAAACTTT AGCCGATTGG GTGGCTTAA 

EF130-2 (SEQ ID NO:486) 

YIKRRENMRP KEKKRGKNWL INSLLVLLFI IGLALIFNNQ IRSWWQQNS RSYAVSKLKP 
ADVKKNMARE TTFDFDSVES LSTEAVMKAQ FENKNLPVIG AIAIPSVEIN LPIFKGLSNV 
ALLTGAGTMK EDQVMGKNNY ALASHRTEDG VSLFSPLERT KKDELIYITD LSTVYTYKIT 
SVEKIEPTRV ELIDDVPGQN MITLITCGDL QATTRIAVQG TLAATTPIKD ANDDMLKAFQ 
LEQKTLADWV A 

EF130-3 (SEQ ID NO:487) 

CGTTAG CAAGTTGAAA 

CCAGCTGATG TGAAGAAAAA TATGGCTCGT GAAACAACGT TTGACTTTGA TTCAGTTGAG 
TCCTTGAGCA CAGAAGCGGT GATGAAAGCC CAATTTGAAA ACAAAAACTT ACCTGTGATT 
GGTGCCATTG CGATACCAAG TGTCGAAATT AATTTGCCCA TTTTTAAAGG ATTGTCCAAT 
GTCGCTTTAT TAACTGGTGC CGGGACCATG AAAGAAGATC AAGTCATGGG GAAAAACAAT 
TATGCCTTGG CTAGTCATCG AACGGAAGAT GGCGTTTCCT TATTTTCACC TTTAGAAAGA 
ACCAAAAAAG ACGAACTCAT TTATATCACT GATTTATCTA CTGTTTATAC ATACAAAATA 
ACTTCTGTAG AAAAAATCGA ACCAACCCGT GTTGAGTTAA TTGATGACGT TCCTGGTCAA 
AATATGATTA CCTTAATTAC CTGTGGCGAT TTACAAGCAA CGACGCGAAT TGCTGTTCAA 
GGAACATTAG CAGCAACGAC GCCTATTAAA GACGCCAACG ACGATATGTT GAAGGCTTTC 
CAATTGGAGC AAAAAACTTT AGCCGATTGG GTGGCT 



EF130-4 (SEQ ID NO:488) 
VSKLKP 

ADVKKNMARE TTFDFDSVES LSTEAVMKAQ 
ALLTGAGTMK EDQVMGKNNY ALASHRTEDG 
SVEKIEPTRV ELIDDVPGQN MITLITCGDL 
LEQKTLADWV A 



FENKNLPVIG AIAIPSVEIN LPIFKGLSNV 
VSLFSPLERT KKDELIYITD LSTVYTYKIT 
QATTRIAVQG TLAATTPIKD ANDDMLKAFQ 



EF131-1 (SEQ ID NO:489) 

TAGGCGGAGG TAAGCGGTAT GCGTAAACGA CATGCAAAGA AAAGACATGG AGGAGTGAAT 
TGGCTTTTTA TAGTATGTTT GTTGGTGGTG ATTGGTGGTA GTGGTTATTT AATAAAAACG 
TTCTTTTTCA CTAGAGATTC ACAAGTTAGT CAAGAATCGA AAGTGGTCTT GGAAGAAGAT 
CGCCGAAGTG ATAATTATGC GAATTTAACG AAAGAAATAG TTGCACCAGA TAGTGGCGAA 
CTTGATCAAA AAATTCAAGA AACAAATTAT ATTGGTTCGG CTTTGATCAT TAAAGATGAT 
CAGGTTTTAG TAAATAAAGG ATATGGCTTT GCCAATTTTG AAAAGCAACA AGCCAACACG 
CCAAACACAA GGTTTCAGAT TGGCTCAATT CAAAAATCTT TTACCACAAC CTTGATCTTA 
AAAGCAATTG AAGAAGGTAA ACTTACATTA GATACAAAAC TCGCTACGTT TTATCCGCAA 
ATTCAAGGTG CTGAGGATAT TACGATTAGC GATATGTTGA ATATGACAAG TGGTTTAAAG 
TTATCAGCAA TGCCTAATAA TATCGTTACC GATGAAGAAA TTATTCAATT TGTTAAACAA 
AATACCATTC AAGTCAATAA AGGAAAATAC AATTATTCCC CAGTAAATTT TGTCCTTTTA 
GCAGGAATGT TAGAGAAAAT GTATCAACGT ACCTATCAAG AATTATTTAA TAATCTTTAT 
CACAAAACGG . CTGGTTTAAA GAATTTTGGC TTCTATGAAA CCTTATTGGA ACAGCCCAAT 
AATTCAACAA GTTATAAATG GACAGAAGAT AATTCATATA ACCAAGTGCT CTCAATTCCT 
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GCAGCTAGTT TTGCCCATGA ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG 
TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT CCACCGCACT TTTGCAAAAA 
TTATGGACGT CTTCTCAGCA AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT 
TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG 
AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG CGAAATACAA AGAATTAATT 
GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT TTTAA 

EF131-2 (SEQ ID NO:490) 

MRKRH AKKRHGGVNW LFIVCLLWI GGSGYLIKTF FFTRDSQVSQ ESKWLEEDR 
RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA NFEKQQANTP 
NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDITISD MLNMTSGLKL 
SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEKMYQRT YQELFNNLYH 
KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFAHEFGTG NVDMTTGDLY 
WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GG I YVHDNYL RLHGVEAGQQ ALVLFSKDMK 
TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF 

EF131-3 (SEQ ID NO:491) 

TTT AATAAAAACG 

TTCTTTTTCA CTAGAGATTC ACAAGTTAGT CAAGAATCGA AAGTGGTCTT GGAAGAAGAT 
CGCCGAAGTG ATAATTATGC GAATTTAACG AAAGAAATAG TTGCACCAGA TAGTGGCGAA 
CTTGATCAAA AAATTCAAGA AACAAATTAT ATTGGTTC GG CTTTGATCAT TAAAGATGAT 
CAGGTTTTAG . TAAATAAAGG ATATGGCTTT GCCAATTTTG AAAAGCAACA AGCCAACACG 
CCAAACACAA GGTTTCAGAT TGGCTCAATT CAAAAATCTT TTACCACAAC CTTGATCTTA 
AAAGCAATTG AAGAAGGTAA ACTTACATTA GATACAAAAC TCGCTACGTT TTATCCGCAA 
ATTCAAGGTG CTGAGGATAT TACGATTAGC GATATGTTGA ATATGACAAG TGGTTTAAAG 
TTATCAGCAA TGCCTAATAA TATCGTTACC GATGAAGAAA TTATTCAATT TGTTAAACAA 
AATACCATTC AAGTCAATAA AGGAAAATAC AATTATTCCC CAGTAAATTT TGTCCTTTTA 
GCAGGAATGT TAGAGAAAAT GTATCAACGT ACCTATCAAG AATTATTTAA TAATCTTTAT 
CACAAAACGG CTGGTTTAAA GAATTTTGGC TTCTATGAAA CCTTATTGGA ACAGCCCAAT 
AATTCAACAA GTTATAAATG G AC AG AAG AT AATTCATATA ACCAAGTGCT CTCAATTCCT 
GCAGCTAGTT TTGCCCATGA ATTTGGGACT GGTAATGTGG ATATGACGAC AGGTGATTTG 
TATTGGTACT TACATCAATT AACGAGTGGA CATTTAGTTT CCACCGCACT TTTGCAAAAA 
TTATGGACGT CTTCTCAGCA AAGCTCTTAT CATGGCGGCA TCTATGTTCA TGATAATTAT 
TTACGTTTAC ACGGCGTTGA AGCGGGTCAA CAAGCCCTGG TTTTATTTTC AAAAGATATG 
AAGACAGGGG TCATATTGCT AACTAACTGT GTGAATCCAG CGAAATACAA AGAATTAATT 
GGTTCGTTGT TCCATGATGT AACCAATTTA ACTGTTAAAT TT 

EF131-4 (SEQ ID NO:492) 

LIKTF FFTRDSQVSQ ESKWLEEDR 

RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA NFEKQQANTP 
NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDITISD MLNMTSGLKL 
SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEKMYQRT YQELFNNLYH 
KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFAHEFGTG NVDMTTGDLY 
WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GG I YVHDNYL RLHGVEAGQQ ALVLFSKDMK 
TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF 



EF132-1 (SEQ ID NO:493) 

TAGTTTTC TAATC TC AC C AAAAC AAAAATTTTTAAGAAAGAAGGAGAGATCGTTATGATGAGAAAATGGAAAGTAGTA 
GTGGGAAGTCTGGGAATCTTGATTGCTCTTTTTATATTCGGGGCAT 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes. 

GCTTCGAACGAAAAATTAAAGGTAGTAGTTACTAATTCGATTTTAGCAGATATTACTGAAAATATAGCAAAAGATAAA 
ATTGATTTACACAGTATCGTACCTATTGGGAAAGATCCCCACGAATATGAACCtTTGCCTGAAGATGTTCAAAAAACT 
TCAAAAGCAGATTTGATTTTTTATAACGGTG 

irATGCGAACAAAGAGGAAAACAAAGACTATTTTGCAGCAAGTGATGGCATAGATGTTATTTACTTAGAGGGTCAGAGT 
GAGAAAGGGAAGGAAGATCCCCATGCTTGGTTAAATTTAGAAAACGGTATTATTTACGCTAAAAATATTGAAAAATGG 
TTAGC GG AAAAAGATCCTGATAATAAAAAATTC TATAAAGAAAATC TAGATAAGTATATTGAAAAGTTGGATTC TC TA 
GACAAAGAAGCTAAATCTAAATTTGCTTCAATTCCGAATGATAAAAAAATGATTGTTACAA 

TATTTCTCGAAAGCGTATAATGTGCCTTCTGCTTACATTTGGGAAAtCAACACTGAAGAAGAAGGAACACCAGATCAA 
ATAAAACACTTAGTTGAAAAATTACGCACAACAAAAGTTCCCTCCTTATTCGTAGAAAGTAGTGTGGACGATAGACCG 
ATGAAAACAGTATCAAAAGATACCAATATTCCTATCTATTCAACGATTTTTACTGATTCAATTGCAGAAAAAGGACAA 
GATGGTGATAGTTACTATGCGATGATGAAATGGAACCTGGATAAAATTGCTGAAGGCCTTTCGAAATAA 

EF132-2 (SEQ ID NO:494) 

MMRKWKVWGSLGMLIALFIFGACSTNSKDKDTVASNEKLKVWTNSILADITENIAKDKIDLHSIVPIGK 
LPEDVQKTSKADLIFYNGVNLXTGGNAWFTKL^ 

YAKNIEKWLAEKDPDl^KFYKENLDKYIEKLDSLDKEAKSKFASIPNDKKMIVTSEGCFKYFSKAYNVPSAYIWEINT 
EEEGTPDQIKHLVEKLRTTKVPSLFVESSVDDRPI^WSKDTNIPIYSTIFTDSIAEKGQDGDSYYAMMKWNLDKIAE 
GLSK. 

EF132-3 (SEQ ID NO:495) 

ATGTTCAACAAATAGTAAAGACAAAGATACAGTGGCTTCGAACGAAAAATTAAAGGTAGT^^ 

AGCAGATATTACTGAAAATATAGC AAAAGATAAAATTGATTTAC AC AGTATCGTAC C TATTGGGAAAG ATC C CC ACGA 

ATATGAACCtTTGCCTGAAGATGTTCAAAAAACTTCAAAAGCAGATTTGATTTTTTATAACGGTGTTAACTTGG 

TGGAGGAAATGCTTGGTTTACAAAATTAGTAAAAmATGCGAACAAAGAGGAA 

TGGCATAGATGTTATTTACTTAGAGGGTCAGAGTGAGAAAGGGAAGGAAGATCCCCATGCTTGGTTAAATTTAGAA^ 
CGGTATTATTTACGCTAAAAATATTGAAAAATGGTTAGCGGAAAAAGATCCTGATAATAAAAAATTCTATAAAGAA^ 
TC TAGATAAGTATATTGAAAAGTTGGATTC TCTAGAC AAAGAAGCTAAATCTAAATTTGC TTC AATTCCGAATGATAA 
AAAAATGATTGTTACAAGTGAAGGATGCTTtAAATATTTCTCGAAAGCGTATAATGTGCCTTCTGCTTACATTTGGG 
AA t C AAC AC TGAAGAAGAAGGAAC AC C AGATC AAATAAAAC AC TTAGTTGAAAAATTACGC AC AAC AAAAGTTC C CTC 
CTTATTCGTAGAAAGTAGTGTGGACGATAGAC CGATGAAAAC AGTATC AAAAGATACC AATATTCC TATC TATTC AAC 
GATTTTTACTGATTCAATTGCAGAAAAAGGACAAGATGGTGATAGTTACTA 
AATTGC TGAAGGCCTTTCGAAA 



EF132-4 (SEQ ID NO:496) 

CSTNSKDKDTVASNEKLKVWTNSILADITENIAKDKIDLHSIVPIGKDPHEYEPLPEDVQKTSKADLIFYNGVNLXT 
GGNAWFTKLVKXANKEENKDYFAASDGIDVIYLEGQSEKGK^ 

LDKYIEKLDSLDKEAKSKFASIPNDKKMIVTSEGCFKYFSKAYWPSAYIWEINTEEEGTPDQIKHLVEKLRTTKVPS 
LFVESSVDDRPMKTVSKDTNIPIYSTIFTDSIAEKGQDGDSYYAMMKWNLDKIAEGLSK 
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Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases 




4.00E-32 


1.40E-31 


6.80E-22 


3.10E-98 


3.10E-98 


L30E-89 


1.30E-89 


2.80E-52 


2.80E-52 


8.70E-116 


1.10E-109 


3.60E-103 


2.30E-102 


1.90E-67 


1.70E-46 


m 

CN 


CN 
CN 


CN 
O 


ON 


ON 

m 


oo 

CN 


oo 

CN 


CN 
CN 
CN 


CN 
CN 
CN 


NO 
NO 

CN 


CN 
CN 


00 
(N 


t— 
cs 


LIZ 


oo 

fN 


lipoprotein [Bacillus subtilis] 


hypothetical protein [Bacillus subtilis] >gnl PID el 182900 


similar to hypothetical proteins [Bacillus subtilis] 


ferric anguibactin-binding protein precusor FatB of V. 


ferric anguibactin-binding protein precusor FatB of V. 


ceuE gene product [Campylobacter coli] 


ceuE gene product [Campylobacter coli] 


40 kDa protein [Plasmid pJMl] >pir|A29928|A29928 
membrane-associated 


40 kDa protein [Plasmid pJMl] >pir|A29928|A29928 
membrane-associated 


pheromone binding protein [Plasmid pCFlO] 
>pir|B53309|B53309 


traC [Plasmid pADl] >pir|A53310|A53310 pheromone cADl 
binding 


TRAC [Enterococcus faecalis] 


TraC [Enterococcus faecalis] 


threonine kinase [Streptococcus equisimilis] 
>pir|S28153|S28153 


dciAE [Bacillus subtilis] 
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TABLE 3. Conservative Amino Acid Substitutions. 
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Aromatic 


Phenylalanine 




Trvntonhan 




Tyrosine 


Hydrophobic 


Leucine 




Isoleucine 




Valine 


Pnlar 


Gliitarnine 




Asparagine 


1? noils 


Arpininp 

/iJ i_. 1 L 1 1 1 1 C> 




Lysine 




Histidine 


Acidic 


Aspartic Acid 




Glutamic Acid 


Small 


Alanine 




Serine 




Threonine 




Methionine 




Glycine j 
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Table 4. Residues Comprising Antigenic Epitope-Bearing Portion. 



EF001-2 


from about Asp- 150 to about Lys-152, from about Ser-256 to about Tyr- 
Z3y, rrom about .Lys-oou to about Lys-joi, irom about Asn-4Uo to about 






FF002-2 


from nhnnt A en -8 0 in siKmif Acrv-R^ from siKmit" Acn 9R1 ir\ ckY>r\iii (~x\\r- 
li will auuui A5]J-OU l\J aUUUl n.op*Oj, liUIli aUUUl /\op-^0 1 UU dUUUl vjiy *■ 

283. 






EF003-2 


from flhnnl Asn-^n^ in aHmit filv-966 

H Will CHJVJUl /\i)U a»U J IVJ aUUUl VJIV Z,uu. 






EF004-2 


frnm aKmit Acn-9^ tn nKmit Acn-9A frm-n ahmi1 T \/o Sl'X in nVimif Qr>r Q7 
llUill aUUUL /-IMl-^J IU oUUUL /\bIl*-ZrU, ilUIIl dDUUL Lya OJ 10 dUUUl uCl"0 /, 

from ahnut Tvr-1^4 tn ahmit A°.n.-1SQ 

ll Will O-L/Wtit 1 VI i w/ i tlUDUv Aol/ 1 J/i 






EFG05-2 


from about Lys-249 to about Glu-252. 






FF006-.9 


irom aooui uiy-zj to aoout ASp-Zo. 






FFOOR 9 
ijruuo-z 


irom aoout inr-yz to aoout \jiy-y4, irom aoout rro-ioi to aoout Asp- 
1 65, from about Gly-287 to about Thr-289. 






ErUI U-Z 


rrom about rro-lZy to about Asn-131. 






EF012-2 


from about Asp-77 to about Asp-79, from about Asp-94 to about Lys-98, 
from about Asp-256 to about Thr-258, from about Glu-461 to about Asn- 

40o. 






EF013-2 


from about Thr-30 to about Asp-32, from about Glu-73 to about Ala-75, 
rrom aoout uin-io4 to aoout Asn-loo, rrom about Lys-iy3 to about Uiy- 
1 yj. 






FF014 9 


rrom aooui oor-zu^ 10 aoout ASp-zuo, rrom about uin-j 14 to about uiy- 
316 






EF015-2 


from about Pro-66 to about Gly-69. 






rirUlb-Z 


irom about Lys-Zio to about Asn-239. 






EF017-2 


from about Ser-90 to about Gly-93, from about Thr-1 97 to about Lys- 
199, from about Lys-230 to about Asn-233, from about Ser-428 to about 
G1v-43 1 






EF018-2 


from about Lys-159 to about Tyr-161, from about Asn-165 to about Ser- 
167, from about Asn-250 to about Arg-256, from about Asn-392 to about 
Gly-395, from about Lys-416 to about Tyr-418, from about Asn-428 to 
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about Arg-430. 






EF019-2 


from about Arg-209 to about Ser-2l l, from about Lys-287 to about Ser- 
290. 






EF020-2 


from about Lys-57 to about Asn-62. 






EF021-2 


from about Ser-33 to about Gly-35, from about Glu-77 to about Gly-8l, 
from about Asp- 1 39 to about Lys-I4l, from about Glu-255 to about Ser- 
258, from about Gln-27l to about Tyr-277. 






EF023-2 


from about Lys-232 to about Asp-234, from about Arg-304 to about Gly- 
306, from about Thr-453 to about Arg-456, from about Ser-478 to about 
Thr-480. 






EF025-2 


from about Arg-1 83 to about Asp- J 85. 






EF026-2 


from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, 
from about Gin- 107 to about Asn-1 10. 






EF027-2 


from about Gln-72 to about Lys-74, from about Lys-229 to about Asp- 
231. 






EF028-2 


from about Asp-186 to about Gln-188. 






EF029-2 


from about Asp-l 18 to about Lys-122, from about Asp-124 to about 
Tyr-126. 






EF031-2 


lrom about ulu-30 to about uly-33. 






EF034-2 


from about Glu-25 to about Gly-27, from about Glu-75 to about Tnr-77. 










EF36-2 


from about Gin- 177 to about Ser-179. 






EF037-2 


from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, 
from about Gin- 107 to about Asn-1 10. 






EF038-2 


from about Asn-77 to about Lys-79, from about Tyr-88 to about Asn-92. 






EF040-2 


from about Lys-167 to about Gly- 172, from about Lys-240 to about 
Asn-242. 
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EF044-2 


from about Arg-192 to about Gly-194, from about Asn-200 to about Asn- 
203. 






EF045-2 


from about Asp- 159 to about Asn-161, from about His- 172 to about Gly- 
174, from about Iyr-2ol to about uly-2o4, irom about Lysou:> 10 aoout 

UlU-JOft. 






EF046-2 


from about Ser-18 to about Gly-23, rrom about um-41 to about ber-4 /, 
rrom about lnr-/o to about Asp-/o. ; 






EF047-2 


from about Asn-28 to about Asp-30, from about Asp-2 /3 to about Asn- 

2 / /. ] 






br04o-2 


trom about Asp-iio to about Lys-141, rrom aoout Asp-ioz to aoout 






EF051-2 


from about Asp-73 to about Gly-76. 






EF053-2 


rrom about ber-79 to about uly-o2. ] 






EF055-2 


rrom about Asp-2o to about (jrly-2o, trom about uin-o/ to aoout Asp-oy, 
rrom aoout Arg- / 1 to aoout vjiy-/H, rrom aoout /\rg-o / tu duuut vjiy-o?. 






T?T?ACi£ O 

br 056-2 


trom about Arg-/1 to about vjiy-/4, trom aoout Arg-o / 10 aooui oiy-oy. 










EF058-2 


from about Lys-129 to about Gly-133, from about Gln-571 to about Tyr- 
573, from about Pro-586 to about Gly-591. 






EF065-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF066-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-49 1 , from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF067-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-49 1, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 
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EF073-2 


from about Met-98 to about Arg-100, from about Arg-1 10 to about Asp- 
112. 






EF074-2 


from about Ser-53 to about Tyr-59, from about Ser-86 to about Gly-88, 
from about Pro-97 to about Gin- 100, from about Gln-230 to about Gly- 






EF076-2 


from about Asn-3o to about Iyr-4U, irom about Asp-4o to about Asno3, 
from about Lys-79 to about Gly-8 1 . 






hrt)77-2 


irom about Arg-4i l to about uly-413, 






EF078-2 


from about Thr-294 to about Gly-296, from about Asp-366 to about Gin- 
368, from about Glu-524 to about Gly-526. 






EF080-2 


from about Glu-164 to about Gly-166, from about Ser-206 to about Tyr- 
208, from about Lys-239 to about Gly-243. 






EF081-2 


from about Asn-7 to about Ser-1 1, from about Lys-77 to about Tyr-80, 
from about Lys-1 12 to about Asn-1 14, from about Gly-162 to about Asp- 
164, from about Arg-1 81 to about Gly-183. 






EF083-2 


from about Gln-38 to about Arg-4U. 






EF084-2 


from about Lys-1 40 to about Asp- 142, from about Gly-164 to about Arg- 
166, from about Arg-262 to about Gly-264. 






EF085-2 


from about Asn-95 to about Asp-97, from about Arg-1 12 to about Asp- 
1 14, from about Asp-258 to about Ser-260, from about Arg-401 to about 
Ser-403. 






EF086-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 






EF087-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 

T A i ft f A A A f*> A 1 A ft A s\ 

Lys-348, from about Asp-428 to about Ser-432. 






EF088-2 


from about Pro-1 12 to about Gly-1 15, from about Ser-222 to about Ser- 
224, from about Asn-296 to about Gly-299, from about Thr-346 to about 
Lys-348, from about Asp-428 to about Ser-432. 
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irom aDout Arg-z 10 aoouL Argo. 






nrvy i -z 


irom aooui vjin-'+u iu <iuuul /\5p-Hj. 






EF093-2 


from about Lys-95 to about Gly-97. 






hruy4-2 


irom about Asp-3 J 4 to about Asp-3 lo. 






EF095-2 j 


from about Ser-328 to about Thr-330, from about Asp-359 to about Asp- 
363, irom about Cj1u-o3 / to about uly-oJv, irom about Asn- Ihh to about 
Uiy-/4o. 






brUyo-z 


irom about rro-izo to about Asn-i ju, irom about ocr-iy^ to aoout Asp- 
196. 






kr\jy f-l 


irom about val-33 / to about Lriy-3 jy. 






EF099-2 


from about Glu-44 to about Asp-47, irom about Lys-154 to about uly- 
156, irom about Asn-zoo to about Asp-z&y. 






fcr 1 U 1 -I 


trom about Lys-4u to about Asp-^z, irom aoout ito-zd d to aoout Asn- 
zdo, irom aooui i^ys-zoo isj douui vjiy-z^u. 






EF 102-2 


from about Asp-3 14 to about Asp-3 16. 






EF103-2 


from about Asn-46 to about Gly-48. 






EF 104-2 


from about Pro-232 to about Lys-237, from about Ala-362 to about Asn- 
366, from about Ser-421 to about Gly-423, from about Lys-488 to about 
Ser-490, from about Asp-550 to about Asn-552, from about Pro-637 to 
about Lys-640, from about Asp-727 to about Gly-729, from about Asn- 
751 to about Ser-754, from about Lys-771 to about Asn-774, from about 
Ile-835 to about Asn-837, from about Pro-851 to about Gly-853. 






EF105-2 


from about Ser-40 to about Gly-43, from about Asn-94 to about Gln-97, 
from about Gln-220 to about Gly-222, from about Asn-263 to about Gly- 

i*\ S~ r 

265. 






EF106-2 


from about Asp-72 to about Gly-75, from about Thr-274 to about Asp- 
277, from about Asn-3 10 to about Arg-3 13. 






EF107-2 


from about Thr-155 to about Asn- 15 7, from about Thr-189 to about Asp- 
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191, from about Arg-270 to about Gly-272, from about Thr-330 to about 
Lys-335, from about Asp-365 to about Asp-368, from about Pro-451 to 
about Asp-453, from about Gly-485 to about Thr-488. 






EF108-2 


from about Lys-142 to about Trp-145, from about Thr-147 to about Tyr- 
150, from about Arg-212 to about Gly-214, from about Ser-248 to about 
Asp-251, from about Asp-384 to about Asp-387, from about Pro-481 to 
about Arg-483, from about Lys-491 to about Gly-494, trom about ini- 
619 to about Gly-624, from about Asp-656 to about Asp-659, from about 
Lys-717 to about Asn-721, from about Ser-822 to about (jly-oz4, trom 

n L rt ,.i T, 1 11*7 fi-v ok/Mil HTVi-r- 11/11 

about lyr-llJ/ to about Jtir-iiHi. 






EF 110-2 


from about Pro-123 to about Gly-127, from about Thr-223 to about Gly- 
225. 






EF111-2 


from about Lys-207 to about Asn-209, from about Asp-245 to about 
Asn-248, from about Lys-396 to about Asp-39o, from about uiu-4zy to 
about oer-4 Jz, trom about i nr-4 /u to about riis-4 /4. 






EF1 19-2 


from about Asp-90 to about Asn-92, from about Gin- 142 to about Gly- 

AAA 

144. 






EF121-2 


from about Asn-159 to about Asp-161, from about Asn-351 to about 
Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to 
about Ser-789. 






EF122-2 


from about Asn- 1 59 to about Asp- 161, from about Asn-3 5 1 to about 
Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to 
about Ser-789. \ 






EF 123-2 


from about Asn-3 31 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp- 1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
from about Lys-1656 to about Asp-1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly- 1942. 






EF 124-2 


from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp- 1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
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from about Lys-1656 to about Asp- 1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly-1942. 






EF125-2 


from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 
636, from about Glu-780 to about Ser-782, from about Tyr-909 to about 
Asn-91 1, from about Lys-939 to about Glu-942, from about Asp- 1074 to 
about Gly-1076, from about Asp-1367 to about Gly-1369, from about 
Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-1518, 
from about Lys-1656 to about Asp- 1660, from about Lys-1860 to about 
Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-1940 
to about Gly-1942. 






EF126-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- j 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF 127-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp-518, from about Glu-639 to about Lys-642. 






EF128-2 


from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- ; 
352, from about Lys-415 to about Asn-418, from about Arg-446 to about 
Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to 
about Asp--> l o, trom about Liiu-o^y to about i^ys-o4z. 






EF129-2 


from about Asn-300 to about Gly-302, trom about Ser-316 to about uly- 
J ly, trom about Asn-ioD to about riisoo / 






EF131-2 


from about Lys-201 to about Tyr-204, from about Glu-263 to about Ser- 
266. 






EF 132-2 


from about Thr-26 to about Ser-28. 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIe \3bis) 

A. The indications made below relate to the microorganism referred to in the description 
on page 10 , line 12 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 

Nan.* of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

10801 University Boulevard 
Manasas, Virginia 201 10-2209 
United States of America 



Date of deposit May 2, 1997 


Accession Number 55969 
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Number of Deposit") 



For receiving Office use only . 



Authorized 



This sheet was received with the international application 
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What Is Claimed Is: 



1. An isolated nucleic acid molecule comprising a polynucleotide having a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding any one of the amino acid sequences of the 
polypeptides shown in Table 1 ; or 

(b) a nucleotide sequence complementary to any one of the nucleotide sequences in (a). 

(c) a nucleotide sequence at least 95% identical to any one of the nucleotide sequences 
shown in Table 1 ; or, 

(d) a nucleotide sequence at least 95% identical to a nucleotide sequence complementary 
to any one of the nucleotide sequences shown in Table 1. 

2. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which 
hybridizes under stringent hybridization conditions to a polynucleotide having a 
nucleotide sequence identical to a nucleotide sequence in (a) or (b) of claim 1 . 

3. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which 
encodes an epitope-bearing portion of a polypeptide in (a) of claim 1. 

4. The isolated nucleic acid molecule of claim 3, wherein said epitope-bearing portion 
of a polypeptide comprises an amino acid sequence listed in Table 4. 

5. A method for making a recombinant vector comprising inserting an isolated nucleic 
acid molecule of claim 1 into a vector. 

6. A recombinant vector produced by the method of claim 5. 

7. A host cell comprising the vector of claim 6. 

8. A method of producing a polypeptide comprising: 

(a) growing the host cell of claim 7 such that the protein is expressed by the cell; and 

(b) recovering the expressed polypeptide. 

9. An isolated polypeptide comprising a polypeptide selected from the group 
consisting of: 

(a) a polypeptide consisting of one of the complete amino acid sequences of Table 1 ; 

(b) a polypeptide consisting of one the complete amino acid sequences of Table 1 except 
the N-terminal residue; 
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(c) a fragment of the polypeptide of (a) having biological activity; and 

(d) a fragment of the polypeptide of (a) which binds to an antibody specific for the 
polypeptide of (a). 

10. An isolated antibody specific for the polypeptide of claim 9. 

1 1. A polypeptide produced according to the method of claim 8. 

12. An isolated polypeptide comprising an amino acid sequence at least 95% identical to 
a sequence selected from the group consisting of an amino acid sequence of any one of 
the polypeptides in Table L 

13. An isolated polypeptide antigen comprising an amino acid sequence of an E. 
faecalis epitope shown in Table 4. 

14. An isolated nucleic acid molecule comprising a polynucleotide with a nucleotide 
sequence encoding a polypeptide of claim 9. 

15. A hybridoma which produces an antibody of claim 10. 

16. A vaccine, comprising: 

(1) one or more E. faecalis polypeptides selected from the group consisting of a 
polypeptide of claim 9; and 

(2) a pharmaceutical^ acceptable diluent, carrier, or excipient; 

wherein said polypeptide is present, in an amount effective to elicit protective 
antibodies in an animal to a member of the Enterococcus genus. 

17. A method of preventing or attenuating an infection caused by a member of the 
Enterococcus genus in an animal, comprising administering to said animal a polypeptide 
of claim 9, wherein said polypeptide is administered in an amount effective to prevent 
or attenuate said infection. 

18. A method of detecting Enterococcus nucleic acids in a biological sample 
comprising: 

(a) contacting the sample with one or more nucleic acids of claim 1, under conditions 
such that hybridization occurs, and 

(b) detecting hybridization of said nucleic acids to the one or more Enterococcus 
nucleic acid sequences present in the biological sample. 
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19. A method of detecting Enterococcus nucleic acids in a biological sample obtained 
from an animal, comprising: 

(a) amplifying one or more Enterococcus nucleic acid sequences in said sample using 
polymerase chain reaction, and 

(b) detecting said amplified Enterococcus nucleic acid. 

20. A kit for detecting Enterococcus antibodies in a biological sample obtained from an 
animal, comprising 

(a) a polypeptide of claim 9 attached to a solid support; and 

(b) detecting means. 

2 1 . A method of detecting Enterococcus antibodies in a biological sample obtained 
from an animal, comprising 

(a) contacting the sample with a polypeptide of claim 9; and 

(b) detecting antibody-antigen complexes. 
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1 . X Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Remark: Although claim(s) 17 

is(are) directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged 
effects of the compound/composition. 



Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

Further defects(s) under article 17(2)(a): 

The gene EF078 which is mentioned in Table 4, is not cited in Table 1 
and is also absent from the sequence listing. 

3. . Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
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This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I | As all required additional search fees were timely paid by the applicant, this International Search Report covers ail 
' ' searchable claims. 

2. [^J As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, t 
I 1 covers only those claims for which fees were paid, specifically claims Nos.: 



, this International Search Report 

i only t 



4. X No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; H is covered by claims Nos.: 



See extra sheet, Invention 1. 

Remark on Protest [ [ The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 

inventions 7 to 41: Claims: (1-21) partially 

Idem as invention 1, but concerning EF008 to EF0042 

Inventions 42 to 74: Claims: (1-21) partially 

Idem as invention l t but concerning EF045 to EF077 

Inventions 75 to 107: Claims: (1-21) partially 

Idem as invention 1, but concerning EF079 to EF111 

Inventions 108 to 123: Claims: (1-21) partially 

Idem as invention 1, but concerning EF117 to EF132 

Invention 124: Claim: 13 partially 

An isolated polypeptide antigen comprising an amino acid 
sequence of an Enterococcus faecal is epitope of EF078 shewn in 
Table 4. 

For the sake of conciseness, the first subject matter is exolicitlv 
:a" : ned, the other subject matters are defined by analogy thereto. 
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